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DEDICATION 


Dr  Raja  Ramanna  attained  the  age  of  60  on  28  January  1985.  He  is  well 
known  to  the  scientific  community  in  India  and  abroad  for  his  dedicated 
contributions  in  various  fields  of  scientific  activity  and  especially  to  the 
atomic  energy  programme  of  the  country.  He  has  been  closely  connected 
with  the  Indian  Academy  of  Sciences  and  had  served  on  its  Council  for 
several  years.  He  was  also  President  of  the  Indian  National  Science  Academy 
and  the  Indian  Physics  Association.  These  three  organisations  jointly  publish 
Pramana  (a  journal  of  physics).  It  is  therefore  most  appropriate  that  this 
special  issue  of  the  journal  is  brought  out  to  commemorate  the  60th  birthday 
of  Dr  Ramanna. 

Ramanna  was  born  in  Mysore  and  had  his  early  education  at  Bangalore  and 
later  at  the  Madras  Christian  College,  Madras.  After  obtaining  his  B.Sc. 
(Hons)  degree  in  Physics  from  Madras  Christian  College  in  1 945,  he  went  to 
Kings  college,  London  and  took  his  Ph.D.  under  Professor  Allan  May  in 
1949.  He  joined  the  Tata  Institute  of  Fundamental  Research  the  same  year.  In 
1 953  he  was  transferred  to  the  Atomic  Energy  Establishment,  Trombay  (the 
present  barc)  as  the  head  of  the  Nuclear  Physics  Division.  He  continued  to  be 
a  member  of  the  tifr  Faculty.  In  1962  he  was  appointed  Director  of  the 
Physics  Group  in  barc  and  in  1972  he  was  appointed  Director  barc  and 
member  for  Research  and  Development  in  the  Atomic  Energy  Commission.  In 
1973  he  was  appointed  as  part-time  Chairman  of  Bharat  Electronics  Ltd., 
Bangalore.  In  1978  he  became  Scientific  Adviser  to  Defence  Minister; 
Secretary  to  the  Government  of  India,  Defence  Research;  and  Director 
General  Defence  Research  and  Development  Organisation.  In  January  1 981 
he  returned  to  barc  as  Director  and  Secretary  to  Government  of  India  in  the 
Department  of  Atomic  Energy  and  Member  for  Research  and  Development, 
Atomic  Energy  Commission.  In  1983  he  was  appointed  Chairman,  Atomic 
Energy  Commission. 

Dr  Ramanna  has  actively  participated  in  the  design  of  India's  first  reactor 
apsara,  and  was  in-charge  of  the  utilisation  of  research  reactors  apsara  and 
emus  and  the  5-5.  MeV  Van  de  Graaff  Accelerator.  He  was  the  Chairman  of  the 
Variable  Energy  Cyclotron  Committee  constituted  to  set  up  a  cyclotron  at 
Calcutta  and  he  was  also  the  Chairman,  Planning  and  Co-ordination 
Committee  of  the  Reactor  Research  Centre  at  Kalpakkam  for  planning  and 
implementing  a  fast  reactor  programme  in  India. 

Ramanna  has  made  outstanding  contributions  in  several  areas  of  nuclear 
science.  This  includes  neutron  thermalisation,  reactor  design,  experimental 
and  theoretical  studies  of  low  energy  nuclear  reactions  with  special  reference 
to  nuclear  fission.  His  research  work  in  the  area  of  nuclear  fission  is 
particularly  well-known.  Under  his  leadership,  pioneering  work  has  been 
carried  out  in  the  studies  pertaining  to  the  emission  of  prompt  neutrons, 


gamma  rays,  K  x-rays  and  light  charged  particles  in  fission.  Extensive  studies 
of  the  fragment  mass,  kinetic  energy  and  angular  distribution  and  the 
correlations  between  them  both  in  the  case  of  thermal  and  fast  neutron 
induced  fission  have  also  been  carried  out.  He  has  also  made  important 
contributions  to  the  development  of  gas  ionization  cum  scintillation  detec¬ 
tors  for  the  experimental  research  programmes  in  the  areas  of  fission  physics. 
In  his  theoretical  research  work,  his  outstanding  contribution  has  been  the 
development  of  the  stochastic  theory  of  fission  based  on  the  nucleon 
exchange  mechanism.  The  importance  of  this  mechanism  in  the  heavy  ion 
reactions  also  is  now  well  recognised.  Ramanna's  role  in  other  areas  of 
peaceful  utilization  of  nuclear  energy  are  well  known. 

Another  important  contribution  of  Dr  Ramanna  is  his  strategy  of  organis¬ 
ation  of  science  in  developing  countries.  He  has  been  Chairman  of  the  'nora' 
Committee  of  the  International  Atomic  Energy  Agency,  which  looked  after 
joint  projects  in  Norway  on  Reactor  Physics,  and  the  India- Philippines- 
Agency  Project  where  for  the  first  time  a  successful  co-operative  effort  of 
Asian  scientists  was  organised.  Currently  he  is  a  Member  of  the  Scientific 
Advisory  Committee  to  the  Director-General,  International  Atomic  Energy 
Agency. 

As  Member  of  the  National  Committee  on  Science  and  Technology  he 
made  available  to  the  nation  his  deep  understanding  of  the  problems  of 
development  of  science  and  technology.  He  is  a  member  of  the  newly 
constituted  Scientific  Advisory  Committee  to  the  Cabinet. 

Among  the  numerous  honours  he  has  received  include  the  Shanti  Swarup 
Bhatnagar  Memorial  Award  for  physical  sciences  1 963,  Padma  Shri  in  1 968, 
Padma  Bhushan  in  1973  and  Padma  Vibhushan  irv1975.  He  was  elected  a 
Fellow  of  the  Indian  Academy  of  Sciences  in  1 966  and  a  Fellow  of  the  Indian 
National  Science  Academy  in  1973.  He  was  awarded  the  Meghnad  Saha 
Medal  of  the  Indian  National  Science  Academy  for  1984.  He  was  the 
President  of  the  Physics  Section  of  the  Golden  Jubilee  Session  of  the  Indian 
Science  Congress.  He  played  a  leading  role  in  the  starting  of  the  Indian 
Physics  Association  and  supported  it  with  great  vigour.  Dr  Ramanna  is 
welcomed  in  scientific  gatherings  not  only  for  his  technical  expertise  but  also 
for  the  witty  remarks  and  the  charming  manner  with  which  he  conducts  the 
meetings.  Those  who  are  fortunate  to  know  him  well  are  aware  of  his  skill  in 
music,  especially  with  piano. 

On  the  occasion  of  Dr  Ramanna  attaining  the  age  of  60  it  is  a  matter  of  great 
pleasure  to  convey  our  best  wishes  on  behalf  of  the  Indian  Academy  of 
Sciences,  the  Indian  National  Science  Academy  and  the  Indian  Physics 
Association  and  on  behalf  of  the  scientific  community  for  many  years  of 
creative  scientific  activity  and  happy  life. 

Bangalore  E.  S.  Raja  Gopal 

February  1985  Chairman,  Editorial  Board 
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Quantum  supersymmetric  generalisation  of  Bogomolnyi  bounds 
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Abstract.  We  review  here  classical  Bogomolnyi  bounds,  and  their  generalisation  to 
supersymmetric  quantum  field  theories  by  Witten  and  Olive.  We  also  summarise  some  recent 
work  by  several  people  on  whether  such  bounds  are  saturated  in  the  quantised  theory. 

Keywords.  Supersymmetry;  gauge  theories;  quantum  solitons;  monopoles. 
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1.  Introduction 

Bogomolnyi  published,  in  1976,  a  very  interesting  paper  (Bogomolnyi  1976)  on  some 
aspects  of  classical  topological  soliton  solutions  of  several  relativistic  field  theories.  His 
procedure  was  generic  in  nature  and  covered  such  examples  of  solitons*  as  Kinks  in  two 
(space-time)  dimensions,  vortices  and  flux  lines  in  3  dimensions  as  well  as  the  famous 
monopole  solution  in  4  dimensions.  The  thrust  of  his  results  was  two-fold.  Firstly  he 
showed  that  the  energy  of  any  finite-energy  field  configuration  in  these  systems  was 
bounded  from  below  by  its  topological  index,  apart  from  a  known  numerical  factor. 
Secondly,  on  requiring  that  this  bound  is  saturated,  one  attains  first  order  partial 
differential  equations — far  simpler  to  solve  than  the  parent  field  equations  which  are 
second  order  non-linear  partial  differential  equations.  The  familiar  solitons  like  the 
Kink,  the  flux  lines  and  the  monopole  can  be  obtained  as  solutions  of  these  first-order 
equations.  (It  must  be  mentioned  that  the  basic  trick  used  by  Bogomolnyi  had  been 
employed  slightly  earlier,  in  the  context  of  instantons,  by  Belavin  et  al  1975). 

Bogomolnyi’s  work  was  at  the  classical  level.  Meanwhile,  methods  have  been 
developed  during  the  past  decade  for  “quantising”  solitons,  to  yield  quantum  soliton 
particles  in  the  corresponding  quantum  field  theories.  A  detailed  discussion  of  the 
quantisation  of  solitons  is  given  in  the  book  by  the  present  author  (Rajaraman  1982). 
One  finds  that  these  quantum  solitons  possess,  in  a  suitably  generalised  sense,  many  of 
the  remarkable  properties  that  the  classical  solitons  did.  One  can  ask  therefore  whether 


*  Strictly  speaking,  most  of  these  solutions  should  be  called  solitary  waves,  rather  than  solitons.  But  here 
we  follow  the  practice  in  most  of  the  recent  literature  of  using  the  term  solitons  to  also  cover  solitary  waves,  ie 
localised,  non-dispersive  solutions.  Note  however,  that  contrary  to  the  incorrect  feeling  that  still  persists  in 
some  circles,  solitary  waves  are  not  limited  to  (1  +  1)  dimensions.  Especially  after  the  entry  of  particle 
physicists  in  this  field,  many  solitary  wave  solutions  have  been  found  in  higher  dimensions  as  well.  Thus, 
interesting  gauge-theoretic  solutions  such  as  vortices  in  (2+1)  dimensions  and  the  ’tHooft-Polyakov 
monopole  in  (3  +  1)  dimensions  are  legitimate  solitary  waves. 
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Bogomolnyi-type  bounds  can  also  be  obtained  for  quantum  solitons,  relating  their 
exact  quantum  masses  to  their  topological  quantum  numbers,  and  whether  these 
bounds  are  saturated. 

In  an  elegant  and  compact  paper,  Witten  and  Olive  (1978)  showed  that  for  quantum 
supersymmetric  extensions  of  some  of  the  models  considered  by  Bogomolnyi,  an  exact 
bound  can  be  obtained  for  the  masses  of  the  quantum  soliton  states.  In  fact,  Witten  and 
Olive  pointed  out  that  the  existence  of  the  soliton  sector  affects  the  v£ry  algebra  of 
supersymmetry  (susy),  introducing  a  central  charge  in  the  algebra  in  that  sector.  This 
central  charge  is  intimately  related  to  the  topological  charge  of  the  soliton  and,  after 
some  manipulations,  yields  a  lower  bound  to  the  masses  of  all  the  states  in  the  soliton 
sector.  Having  obtained  the  bounds,  Witten  and  Olive  also  offered  some  speculations 
on  whether  quantum  solitons  actually  saturate  these  bounds,  as  in  fact  they  do  at  the 
classical  approximation. 

The  question  of  whether  these  bounds  are  saturated  at  the  quantum  level  is  an 
important  one.  For,  if  they  are,  they  would  give  us  some  handle  on  the  exact  quantum 
mechanical  masses  of  these  soliton  particles.  One  must  remember  that  calculating  the 
mass-spectrum  is  one  of  the  primary  goals  of  particle  physics.  Yet  hitherto  no  one  has 
been  able  to  calculate  exactly,  the  mass  of  any  massive  bound  state  in  any  non-trivial  4- 
dimensional  field  theory!  We  emphasize  that  these  quantum  bounds  apply  not  just  to 
solitons  in  two  space-time  dimensions  like  the  Kink  or  the  sine-Gordon  soliton,  but 
also  to  some  solitons  in  realistic  four-dimensions  as  well,  such  as  the  (supersymmetric) 
monopole. 

Recent  investigations  by  several  workers  have  pursued  further  the  question  of 
whether  the  Witten-Olive  bounds  are  saturated  and  the  related  questions  of  whether 
the  soliton  mass  and  its  topological  index  each  receive  quantum  corrections  in  these 
susy  models.  The  detailed  results,  as  we  will  see,  vary  with  the  models. 

In  this  article,  we  review  the  developments  mentioned  above.  This  is  strictly  a 
review — prepared  specifically  for  the  special  volume  in  honour  of  Dr  Raja  Ramanna. 
There  are  no  new  results  in  this  article,  but  only  a  synthesis  of  results  already  contained 
in  the  references  cited.  Also,  in  order  to  keep  the  length  of  this  article  within  its 
prescribed  bounds,  we  have  to  assume  that  the  reader  is  familiar  with  the  overall 
background  in  which  our  specialised  topic  is  imbedded,  which  includes  the  basics  of 
gauge  theory,  supersymmetry,  classical  and  quantum  solitons,  and  the  ’t  Hooft- 
Polyakov  monopole. 

2.  Classical  Bogomolnyi  bounds 

Bogomolnyi  inequalities  at  the  classical  level  are  well-known.  Our  chief  concern  here  is 
their  quantum  generalisation.  Nevertheless,  for  the  sake  of  completeness,  let  us  quickly 
sketch  the  classical  bounds  and  their  derivation.  We  will  use  as  illustrations  (1  + 1)- 
dimensional  scalar  field  theories  as  well  as  the  (3  +  1)  dimensional  Georgi-Glashow 
model  which  yields  that  famous  soliton,  the ’t  Hooft-Polyakov  monopole  (Georgi  and 
Glashow  1972;  ’t  Hooft  1974;  Polyakov  1974). 

Let  us  begin  with  the  simple  problem  of  a  real  scalar  field  theory  in  (1  +  1) 
dimensions,  with  any  potential  which  has  degenerate  minima  so  that  topological 
solitons  can  arise.  The  Lagrangian  is 


(i) 
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where  without  loss  of  generality,  we  can  take  the  potential  to  be  positive  and  to  vanish 
at  its  absolute  minima.  We  write  this  potential  in  the  form  %S2((f))  for  the  later 
convenience  when  we  supersymmetrise  the  system  in  §3. 

The  classical  energy  functional  associated  with  this  Lagrangian  is 


E  = 


+i  s2(<t>) 


Clearly,  all  finite  energy  configurations  must  approach  one  of  the  zeroes  of  S((fi)  at  the 
spatial  extremities  x  =  —  oo  and  x  —  +  oo.  We  are  interested  in  a  lower  bound  to  the 
energy,  and  therefore  need  to  consider  only  static  configurations,  where  the  (positive) 
kinetic  energy  is  absent.  The  energy  of  static  (time  independent)  fields  (p(x )  is 


E  = 


00 


dx 


-  00 


\(<iy 

dx 


+is2(<p) 


00 


—  00 
*00 


djci(S±sw,)|  T 


dx  S(</>) 
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d0 

dx 


d  xS((j>) 


—  00 


dx 


=  2[D(«(x))];:? 


00  > 


where 


D(4> ) 


d  <p 


(3) 

(4) 


This  is  the  Bogomolnyi  bound  for  this  system  and  sets  a  lower  bound  on  the  energy  of 
any  field  configuration.  The  bound  is  saturated ,  i.e.  the  equality  in  (3)  is  obeyed 
provided  (i)  <p  is  static,  and  (ii)  it  obeys 


d(p/dx  =  ±S(<p) 


Meanwhile,  the  field  equation  arising  from  (1),  for  static  configurations,  is 


d2</> 

dx2 


(5) 

(6) 


The  first  integral  of  this  field  equation  is  just  the  saturation  condition  (5).  Hence  any 
static  classical  solution,  including  the  topological  soliton,  will  satisfy  (5)  and  therefore 
saturate  the  bound  (3).  Thus  we  know,  without  explicitly  finding  the  soliton  solution, 
that  its  energy  will  be  just  Z)(</>2)  ~  £(0i)  where  (j)2 ,  </>i  are  the  boundary  values  of  the 
soliton  at  x  =  +  oo  respectively.  Clearly  this  bound,  depending  as  it  does  only  on 
boundary  values  of  the  field,  is  a  topological  index  of  the  soliton. 

As  an  illustration,  consider  the  kink  solution  of  the  double-well  potential,  where 


The  kink  solution  goes  from  =  -ptjk  at  x  =  -  oo  to  (p2  =  +/Vv^  at  x  =  oo. 
Hence  the  bound  is 
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d«>2)-d«>.)  = 


2  W 

3  A 


(7) 


Meanwhile,  on  explicitly  solving  the  field  equation,  the  kink  solution  has  the  well- 
known  form 


^kink  _ 
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tanh 


L ix 


JX  y/2' 


with  a  classical  energy  (the  classical  kink  mass)  equal  to 


KL 


2y/2fl3 

3  X  ' 


(8) 


This  exactly  equals  the  bound  (7),  as  expected. 

As  a  second  example,  let  us  turn  to  the  Georgi-Glashow  model,  which  is  a  non- 
abelian  (3  +  1)  dimensional  gauge  theory,  with  a  triplet  of  scalar  fields  (j)a  coupled  to  a 
triplet  of  SU(2)  gauge  fields  Aa.  The  Lagrangian  density  is 


se=\  (Djr  (D“<t>Y  -  2  -  C2)2  - 1  FJv 

with  n,  v  =  0,  1,  2,  3;  a  =  1,  2,  3,  (D^f  =  <3„0“  +  geabcAbl,(l>c 
and  F“=8uA“-dyA‘,u+geabcAl’uAcv. 


(9) 


(10) 


Although  this  system  is  more  complicated  than  (1),  Bogomolnyi  showed  that  a  bound 
can  be  obtained  in  a  similar  way.  Consider  again  the  energy  functional  for  static 
configurations,  in  the  A  aQ  —  0  gauge. 


E  =  d3x 


X 


2\2 


kF'jFtj  +  i  +  -  (rr  -  C2) 


00, 


(11) 


(12a) 

(12b) 


with  i,j  =  1,  2,  3.  Finiteness  of  energy  requires,  as 
(t>a<t>a  ->  C2, 

and  Faij,  D, :</>"-»  0. 

Each  field  configuration  permitted  by  the  boundary  condition  (12a)  corresponds  to  a 
mapping  of  a  two-sphere  S2  into  another  S2.  Since  the  second  homotopy  group  n2(S2) 
—  Z,  such  configurations  fall  into  homotopy  classes  characterised  by  an  integer-valued 
index,  given  by 

1 


N 


da,eijk  eabc(<j>°)  (dj$b)  (dk<f>c), 


871 


(13) 


where  $a  =  =  (j>a/C .  Recall  also  the  result  that  a  configuration  with  homotopy 

index  N  carries  magnetic  charge  m  =  N/g.  Fora  detailed  discussion  of  these  well  known 
results,  see  Rajaraman  (1982).  The  energy  in  (1 1)  can  be  written,  after  a  little  algebra,  in 
the  form 


> 


47iC 
9 
4nC 


N  + 


d3x 


i(F“j  -  £ijkDk4>a)2  +  j  (</,>“  -C2)2 


N  =  4nCm 


9 


(14) 
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where  m  is  the  magnetic  charge  of  the  configuration.  This  is  Bogomolnyi’s  inequality  for 
the  Georgi-Glashow  model.  In  the  so-called  Prasad-Sommerfield  limit  (Prasad  and 
Sommerfield  1975),  where  X  -*■  0  with  C  fixed,  the  bound  (14)  is  saturated,  provided 

Faij  =  eijk(Dk(t>)a.  (15) 

A  solution  of  (15),  by  virtue  of  minimising  the  static  energy  in  any  given  N-sector,  will 
also  be  a  classical  solution  of  the  field  equations.  In  fact,  the  well-known  Prasad- 
Sommerfield  solution  for  the  monopole 


coth  ( rgC ) 


</>a  =  CV 
4?  =  —eaibrbl  1 


1 


Qr 


rgC 
rgC 

sinh  rgC 


;  Aq  =  0;  and 


(16a) 


satisfies  (15)  and  lies  in  the  N  =  1  sector.  Correspondingly,  its  classical  mass, 

=  4  nC/g,  (16b) 


saturates  the  bound  (14).  Note  that  (15)  is  a  first-order  equation,  like  (7),  although  the 
parent  field  equations  derived  from  (10)  would  be  of  second  order. 

Similar  results  can  also  be  obtained  for  the  Nielsen-Oleson  vortex  lines,  the  CPn  and 
non-linear  03  models  in  (2-1-1)  dimensions.  Note  that  the  discussion  in  this  section, 
based  on  Bogomoinyi’s  (1976)  paper,  has  been  entirely  at  the  classical  level. 


3.  The  Witten-Olive  bounds 


Witten  and  Olive  (1978)  considered  the  quantised  and  supersymmetric  extension  of  the 
models  discussed  in  the  previous  section.  First  take  the  two-dimensional  example  and 
consider  the  Lagrangian  density 


&  =  i  [  W„</>)2  ~  s2 WO  +  t  ~  5' WOW]4, 


(17) 


where  \p  is  a  Majorana  spinor  in  two  dimensions.  The  two  y-matrices  could  be  taken  as, 
say  y°  =  o2  and  y1  =  -  iox .  S(</>)  is  any  function  of  the  real  scalar  field  (p  which  permits 
topological  solitons.  It  must  have  degenerate  minima.  This  Lagrangian  is  supersym¬ 
metric  under  the  transformation 


+  \f,  tA^  +  (-ov^-sW>)K;  ?-+f  +  i(iyM^l0-s)  (18) 


where  (,  (  are  Grassmann  numbers.  ForS  =  (X/2)1/2  W>2  -C2),  the  system  (17)  is  the 
supersymmetric  extension  of  the  double  well  system  (1). 

The  supercharge  which  generates  the  transformations  (18)  is  also  a  Majorana  spinor 

Q  given  by 


dx(yMdM</>  -I-  iS)y°\l/. 


(19) 


j 


In  our  representation  of  y-matrices,  charge  conjugation  is  just  complex  conjugation. 
Hence  the  Majorana  spinors  Q  and  ip  have  only  real  components: 


(20) 
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with  Qx  =  dx  [(^o0  + ~S^2], 


6: 


djc[(5o</>-5,^)^2+S^i]. 


(21) 


Using  the  canonical  commutation  rules  for  the  fields  <j>  and  ^  it  is  straightforward  to 
check  that 


(Qif-H  +  P, 

(22a) 

(Q2)2  =  H  —  P, 

(22b) 

and 

{61.62}+  =  2f d xSffl ^  =  2 [D(*)]J£ :  * T, 

(22c) 

where  H  and  P  are  the  total  energy  and  momentum  operators  respectively,  for  the 
system  (17),  and  D(</>)  =  jd <j>  S((p).  Equations  (22a,  b)  can  be  recognised  as  part  of  the 
“usual”  susy  algebra  in  2  dimensions.  But  in  the  “usual”  algebra,  the  anticommutator  in 
(22c)  would  have  been  taken  to  vanish,  and  indeed  it  does  vanish  in  the  vacuum  sector, 
where  (p(x)  takes  the  same  value  at  x  —  —  oo  and  x  =  +  oo,  and  so  will  D((p(x)i 

But,  in  the  soliton  sector,  where  </>(oo)  ^  </>(  —  oo),  D(0(oo))  need  not  equal 
D((j>(  —  oo)).  In  fact,  for  systems  we  are  interested  in,  ie  those  which  support  static 
topological  solitons,  £)($( oo))  will  not  equal  D(<j)(  —  oo)).  To  see  this,  note  that  a  static 
topological  soliton  will  go  from  some  $~$1atx  =  —  oo  to  some  other  </>  =  (j>2  at  x 
=  oo,  where  </>!  and  (p2  are  two  neighbouring  distinct  zeroes  of  S((p).  Therefore  and 
<p 2  will  also  be  two  neighbouring  extremes  of  D($),  one  of  them  a  minimum  and  the 
other  the  next  maximum.  Clearly  D(<p  t)  cannot  equal  Z)(</>2).  Thus,  in  the  sector  of  states 
based  on  the  topological  soliton,  the  operator  T  is  non-vanishing,  and  forms  (as  can  be 
verified)  a  central  charge  in  the  susy  algebra.  (See,  however,  remarks  by  Schonfeld 
(1979)  on  the  validity  of  (22)  in  the  face  of  boundary  conditions.) 

The  quantum  version  of  the  bound  (3)  follows  immediately  from  the  algebra  (22).  We 
have 

H=HQl  +  Q22) 


=  i[(Qi  +  Q2)2  +  {Qi>  Q2}  +  ] 

=  i[(Gi±e2)2+r] 

>m  (23) 


This  operator  inequality  is  the  quantum  generalisation  of  (3),  obtained  by  Witten  and 
Olive  for  the  susy  system  (17).  The  last  step  in  (23)  holds  because  (QY  ±  Q2)2  is  a  real 
non-negative  operator.  Equation  (23)  is  an  operator  inequality,  ie  it  holds  for  the 
expectation  values  of  both  sides  taken  between  any  state  0/ the  quantum  field  theory 
associated  with  (17).  In  the  vacuum  sector  <  T>  =  0  for  all  states,  so  that  (23)  is  trivial. 
But  in  the  soliton  sector  it  is  not.  The  quantum  soliton-particle  at  rest  is  the  lowest 
energy  state  in  this  sector.  Let  us  denote  it  by  |  sol  >.  Then  the  full  quantum  mass  of  the 
soliton,  Msol,  obeys 


Msol  =  <  sol  |  // 1  sol  > 

>  i  |  <sol |7'|sol  >  |. 


(24) 
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Note  that  (24)  is  still  in  the  form  of  an  inequality.  We  will  return  later  to  the  question  of 
whether  the  equality  holds. 

A  similar  bound  can  be  obtained  for  the  quantum  supersymmetric  monopole  by 

considering  a  susy  extension  of  the  Georgi-Glashow  model  (9-10).  The  Lagrangian,  in 
(3  +  1)  dimensions,  is 


+  (D^Yj+HDJjy  (0,0/ 

.2 


+  \  Tr  ( O i ,  <t>i] )2  +  ~  eJk  Tr  { [$• >,  <//k] <t> !  +  [$J,  y5i/^‘ ]  4>2 }  (25) 


where  1,  2;  a  =  1,  2,  3  form  a  pair  of  isotriplet  Majorana  spinors,  0"  is  a  scalar 

and  0 5  a  pseudoscalar  isotriplet,  and  F*v  the  gauge  field  tensor  of  the  su(2)  gauge  group. 
We  have  also  used  the  popular  notation  of  representing  isotriplets  0"  and  t j/°  by  matrix 
valued  fields  <f)j  =  (xa/2)(j)aj  and  —  (+72)0“  respectively  in  the  last  two  trace  terms  in 
(25).  The  model  (25)  enjoys  N  —  2  supersymmetry  whose  generators  are 


Qi  =  Tr 


x  +  Sij  {P4>i)  + 


h-^C0i  ,  02]y°y5*Ai (26) 

where  i  =  1,  2  and  the  spinorial  indices  of  Q  and  0  are  suppressed.  When  the 
anticommutators  of  the  <2,  are  evaluated  using  the  canonical  commutation  rules  of  the 
fields,  one  obtains 


where 


{Qh  Qj}+  =  Sii'fPv  +  £ij(U  +  y5  F), 


t/EE 


F  = 


(27) 

(28a) 

(28b) 


Note  that  the  boson  potential  in  (25)  has  the  positive  form  Tr([01?  02])2.  When  <pl 
=  0 2  —  0,  this  vanishes.  Correspondingly,  one  choice  of  the  vacuum  is  the  symmetric 
one,  with  <</>i  >vac  =  <02  >vac  =  However,  Tr  ([0X,  02])2  also  vanishes  if  the 
matrices  t/q  and  02  commute,  ie  if  the  iso-vectors  0“  and  02  are  parallel  in  internal 
space,  regardless  of  the  magnitudes  |0J|  and  \(p2[  This  choice  <0"  >vac  oc  <02)vac,  ' 
with  either  non-zero,  corresponds  to  spontaneous  breaking  of  the  su(2)  gauge 
symmetry.  Now,  consider  the  charges  U  and  V  in  (28).  Both  are  integrals  of  divergences 
and  can  be  written  as  surface  integrals  at  infinity.  In  the  symmetric  case,  <  0 \  >vac 
=  <  0  j  >vac  =  0,  both  0 i  and  02  will  vanish  at  spatial  infinity  for  any  physical  state,  and 
hence  U  —  V  =  0.  Then  (27)  reduces  to  the  “usual”  susy  algebra.  It  is  only  when 
<0“  )Vac  or  (^2  )vac  *s  non-zero  that  the  central  charges  U  and  V  come  into  play. 

The  quantum  Bogomolnyi-type  bound  follows  easily  from  (27).  A  simple  trick,  used 
by  Witten  and  Olive  is  to  exploit  the  fact  that  the  system  (25)  is  also  chirally  invariant. 
Under  a  chiral  rotation,  the  fields  0?  and  02  rotate  into  one  another,  and  therefore  so 
do  U  and  V.  Perform  a  rotation  such  that  <  V  )  =  0.  Take  the  expectation  value  of  (27) 
between  any  energy-momentum  eigenstate  (whether  solitonic  or  otherwise)  at  rest. 
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Then  (27)  gives 

<{Qi,Qi}+>=ZijM  +  eiJ ,y°<U>,  (29) 

where  M  is  the  invariant  mass  of  the  state.  Note  that  Q,  (or  Qj)  is  also  a  spinor  in  Dirac 
space,  with  spinor  index  a  (or  /?)  which  we  had  suppressed  so  far,  in  addition  to  the  index 
i  (or  j)  which  takes  values  1,  2.  Considered  as  an  8  x  8  matrix,  <  {QiaL,  Qjfi}  +  )  is  a  positive 
matrix.  It  will  have  real  non-negative  eigenvalues.  Therefore  so  will  the  matrix 
SijM  +  eyy°<l7  ),  thanks  to  (29).  But  e^y °<  U  )  clearly  has  eigenvalues  ±  <  U  ).  Hence, 

M^|<U>|.  (30) 

This  result  was  obtained  after  setting  <  V  )  =  0  through  a  chiral  rotation.  The  general 
result  is  clearly  a  chirally  invariant  generalisation  of  (30),  viz. 

M2  ^  <  (7  >2  +  <  F>2.  (31) 

This  is  the  Witten-Olive  bound  for  the  susy  Georgi-Glashow  model,  for  the  mass  of 
any  state,  in  terms  of  the  expectation  values  of  U  and  V  in  that  state.  For,  the  case 
<0i  >vac  =  <05  >vac  =  which  corresponds  to  full  unbroken  su(2)  gauge  symmetry, 
<  1/  )  =  <  F>  =  0  for  any  state.  Then  the  bound  (31)  is  trivial.  But,  for  all  symmetry- 
broken  cases,  with  either  <  0“  )vac  or  <  02  )vac  non-zero,  the  bound  (31)  is  non-trivial.  In 
such  cases,  it  is  applicable  not  only  in  the  solitonic  sectors,  but  also  in  the  vacuum  sector. 
That  is,  (31)  can  be  used  to  get  a  lower  bound  on  the  exact  quantum  masses  of  the 
monopoles  and  dyons,  as  well  as  the  vector  bosons  and  the  Higgs  bosons  of  this  susy 
theory.  Thus,  this  result  of  Witten  and  Olive  is  a  truly  remarkable  one. 


4.  Saturation  of  the  bounds 


The  quantum  Witten-Olive  bounds  (23)  and  (31)  reduce,  in  the  appropriate  classical 
limit,  to  the  Bogomolnyi  bounds  discussed  in  §2,  and  are,  furthermore,  saturated 
classically  by  soliton  solutions.  For  the  (1  +  1)  dimensional  scalar  field  case,  it  is  obvious 
that  the  quantum  bound  (23),  where  the  operator  T is  given  in  (22c),  reduces  to  just  the 
Bogomolnyi  bound  (3)  in  the  classical  limit.  This  classical  bound,  as  we  showed  in  §2,  is 
saturated  by  the  soliton.  For  the  (3  +  1)  dimensional  gauge  theory,  the  relationship  of 
the  classical  bound  (14)  to  the  quantum  bound  (31)  may  be  made  more  transparent  as 
follows.  Using  chiral  symmetry  freedom,  we  can  consider  the  case  where  only 
<0i  )vac^  0,  with  <02  >vac  =  0-  Let  the  modulus  |  <05  >vac|  =  C,  with  the  internal 
space  direction  of  <0 \  >vac  arbitrary.  For  all  finite  energy  systems,  topological  or 
otherwise,  1 05 1  must  then  tend  to  C  and  05  must  tend  to  zero  as  |x|  -*  oo.  The  gauge 
group  su (2)  is  broken  down  to  U(l)  by  such  a  vacuum.  The  associated  electromagnetic 
field  Fevm ■  has  the  gauge  invariant  form  (’t  Hooft  1974) 


’em.  _  fa 

tiv  —  £  1  r  Jiv 


1 


gC' 


£abc01  (£\z0l)*  (Dv<t>l)C 


00 


c 


4>\f: 


Hv 


(32) 


OO. 


since  finiteness  of  energy  requires  that  (D^ i)a->0  faster  than  l/|x|3/2  as 
Now,  consider  the  charges  U  and  V  defined  in  (28).  Since  05->Oasx->oo,  we  have, 
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using  Gauss’  theorem, 


U  =  fsd<T'  F°,')  =  d<T‘F ”,m  =  4nCq’ 

V=0  d  <7ii(^teiJkF%)  =  C(j>  d(Tf  $etJkF%m-  =  AnCm, 

Js  Js 


(33a) 


(33b) 


where  q  and  m  are  just  the  total  electric  and  magnetic  charge  operators.  Hence  the 
bound  (31)  can  be  written  as 


M2  >(4nC)2[<qy+  <m>2], 


(34) 


where  <(g)  and  <m)  are  the  expectation  values  of  the  total  electric  and  magnetic 
charges  of  the  state  in  question.  Thus  this  bound  applies  non-trivially  to  particles  which 
carry  either  electric  or  magnetic  charge,  or  both.  Equation  (34)  is  still  the  same  quantum 
bound  as  (31),  but  rewritten  in  terms  of  more  familiar  charges.  In  the  classical  limit, 
when  applied  to  static  topological  configurations  in  the  A0  =  0  gauge  (which  carry  zero 
electric  charge)  it  reduces  to  precisely  the  classical  Bogomolnyi  bound  (14).  As  pointed 
out  in  §2,  the  exact  single-monopole  solution,  in  the  Prasad-Sommerfield  limit, 
saturates  this  classical  bound.  For  states  which  carry  no  magnetic  charge  (these  will  be 
non-solitonic  states,  in  the  N  =  0  homotopy  sector)  the  bound  in  (34)  is  still  useful, 
giving 

M^4nC\(q)\.  (35) 

The  familiar  “quanta”  of  the  fields,  such  as  the  Higgs  boson,  the  massive  Vector  (IF) 
boson  and  the  photon  of  this  theory  come  in  this  category.  Notice  again  that  at  the 
classical  (“tree”)  level,  the  IF  boson  has  electric  charge  q/4n  and  mass  gC ,  thus  saturating 
the  bound  classically.  The  photon  is  a  special  case.  Thanks  to  unbroken  (7(1)  gauge 
invariance,  the  photon  ( q  =  0)  has  exactly  zero-mass  quantum  theoretically,  and  will 
infact  saturate  the  exact  quantum  bound  (35).  For  the  other  particles,  ie  the  monopole, 
the  IF-boson,  and  the  Higgs  boson  of  this  model,  as  well  as  for  the  solitons  of  the  two- 
dimensional  model,  while  precise  quantum  bounds  exist  in  the  form  of  (24)  and  (34), 
their  saturation  at  the  quantum  level  calls  for  further  discussion.  The  rest  of  this  section 
is  devoted  to  this  question. 

Let  us  begin  with  solitons  of  (1  +  1)  dimensional  scalar  field  theories,  whose  quantum 
bound  is  given  in  (24).  There  is  no  rigorous  closed  result  available,  as  far  as  we  know, 
establishing  that  the  quantum  soliton  must  saturate  the  bound.  What  most  people  have 
attempted  is  to  evaluate  separately  the  quantum  corrections  to  both  sides  of  the  bound, 
ie  to  the  soliton  mass  and  to  its  topological  index  T.  Then  one  can  see  if  these  corrections 
equal  one  another.  Of  course  this  is  not  a  completely  definitive  way  of  answering  the 
question.  Quantum  corrections  are  generally  calculated  using  a  semi-classical  loop 
expansion,  in  powers  of  h.  In  practice  such  a  calculation  can  be  done  only  up  to  some 
given  order,  usually  to  order  h.  If  the  two  sides  of  (24)  agree  upto  0(h) ,  that  by  itself  does 
not  guarantee  that  they  will  agree  to  all  orders  in  h.  (See  however  an  ingenious  and 
indirect  argument  by  Witten  and  Olive,  based  on  counting  of  states,  suggesting  that  the 
monopole  does  exactly  saturate  the  quantum  bound.  See  also  Imbimbo  and  Mukhi 
(1984a)  for  the  2-dimensional  soliton).  Of  course,  if  the  two  sides  of  (24)  do  not  equal 
each  other  upto  order  h,  then  we  can  be  sure  that  the  bound  is  not  saturated. 

Let  us  first  consider  calculations  of  the  0(h)  corrections  to  the  soliton  mass.  The 
general  principles  of  soliton  quantisation,  particularly  in  (1  -I- 1)  dimensions,  are  by  now 
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well  known  (for  reviews,  see  Rajaraman  1975;  Jackiw  1977;  Coleman  1977)  so  that 
we  need  not  present  the  details  of  this  calculation  here.  But  there  are  some  special 
features  which  arises  in  susy  models  that  one  should  be  careful  about.  The  soliton’s 
quantum  mass  to  0(h)  is  given  by 


X  en)+M  C.t> 

*B>0  J 


(36) 


where  wj  and  en  are  respectively  the  eigenvalues  of  the  boson  fluctuation  equation  and 
the  fermion  Dirac  equation  in  the  background  of  the  soliton.  Mc  L  is  the  contribution  of 
the  0(h)  renormalisation  counter  term.  For  the  system  (17),  the  eigenvalue  equations 
determining  w2  and  en  are 


+  (S')2  +  SS " 


«/„(*)  =  wiiJx) 


(37) 


-ia—  +  pS'  )i l/„(x)  =  e„il/„(x) 


(38) 


where  S 


■-[ 


d<t> 


S(4>) 


and  S "  = 

~d2S - 

^solC*) 

d(j)2 

0sol  (■*) 


D’Adda  and  Di  Vecchia  (1978)  made  the  following  important  observation  regarding 
(37)  and  (38).  Let  us  write  the  two-component  spinor  {//„  as 


It  will  be  convenient  to  use  the  representation  ft  =  <r2>  a  =  —  .  Then,  upoif  Squaring 

the  Dirac  Hamiltonian  in  (38),  we  get 

(iaidx  +  a2S')2\l/„  =  (  -d2x  +  (S')2  -<r3 

Since  the  soliton  obeys  d</>soJ/dx  =  —S((pso j),  dS'/dx  =  (S")d<£sol/dx  =  —S"S,  (39) 
yields,  for  each  component  of  i )/„, 

(-di  +  (S’)2  +  SS")\l/{n+)  =  e2nV:\  (40a) 

(-d2x+(S')2  -SS")^  =  eW~\  (40b) 


One  can  see  that  thanks  to  supersymmetry  the  upper  component  of  the  spinor, 
obeys  the  same  eigenvalue  equation  as  (37),  obeyed  by  the  boson  fluctuations  r\n.  Thus 
any  solution  of  (37)  would  also  serve  as  a  solution  of  (40a)  with  the  same  eigenvalue  and 
vice  versa.  The  associated  could  then  be  determined  from  the  Dirac  equation  (38)  ie 

^  =  -(idx  +  iS’)iti+).  (41) 

As  evident  from  (41),  this  matching  between  boson  and  fermion  fluctuations  may  not 
hold  for  zero-energy  (en  =  0)  modes,  but  we  need  not  worry  about  these  for  our 
purposes,  since  they  make  no  contribution  to  the  energy  in  (36). 

Upto  this  point,  these  observations  of  D’Adda  and  Di  Vechia  are  interesting  and 
correct.  However,  from  this  they  concluded  that  £wn  —  Zen  =  0  and  that  the  soliton 
receives  no  quantum  corrections  to  its  mass  upto  order  (h).  This  is  not  correct  for  two 
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reasons,  as  pointed  out  by  Schonfeld  (1979)  and  by  Kaul  and  Rajaraman  (1983): 

(i)  The  counter-term  contribution  Mc{  has  been  neglected.  Normal  ordering 
counter  terms  proportional  to  SS "  exist  for  these  models  consistent  with  supersym¬ 
metry  and  make  a  non-zero  (and  in  fact  divergent)  contribution  Mc  t 

(ii)  A  more  subtle  point  is  that  (Ew„  —  Ze„)  does  not  vanish  even  though,  loosely 
speaking,  every  non-zero  eigenvalue  w  of  (37)  also  occurs  as  an  eigenvalue  of  (38)  and 
vice  versa.  The  reason  is  that,  apart  from  a  few  discrete  levels,  the  spectra  of  both  (37) 
and  (38)  are  continuous,  and  the  densities  of  continuum  level  in  the  two  cases  are 
different.  The  simplest  way  to  understand  this  difference  in  density  states  is  to  put  the 
system  in  a  box  of  length  L,  and  later  take  L  -*■  oo.  Then  the  Boson  eigenfunction  rjn(x) 


obeys  boundary  conditions 

tln(~L/2)  —  t]n(L/2),  (42a) 

and  ( “ Li2)  =  (L/2)’  (42b) 

as  appropriate  to  the  second  order  differential  operator  in  (37).  However,  the  fermion 
eigenfunction  \j/„(x)  obeys  the  first  order  Dirac  equation  (38),  and  correspondingly, 
obeys  first  order  boundary  conditions,  but,  for  both  components: 

W  (- L/2)  =  WHL/2),  (43a) 

and  L/2)  =  r^(L/2).  (43b) 

The  condition  (43b)  can  be  reduced,  using  (41),  to  a  condition  on 

\_(dx  +  S')^+)  (*)]-«  =  [K  +  SW'MV  (43c) 


We  can  see  that  although  both  r/„(x)  and  t ^^{x)  obey  the  same  differential  equations 
(37)  and  (40a),  they  must  satisfy  different  boundary  conditions.  The  condition  (42a)  is 
the  same  as  (43a),  but  (42b)  is  not  generally  the  same  as  (43c),  especially  for  topological 
solitons  for  which  S'  —  (dS/d^)^*)  is  not  the  same  at  x  =  ±L/ 2,  as  oo.  The 
eigenvalues  of  differential  operators  are  specified  both  by  the  differential  form  of  the 
operator  as  well  as  the  boundary  conditions.  Therefore,  the  set  of  boson  eigenvalues  wj 
and  fermion  eigenvalues  el  will  in  fact  be  different,  for  any  finite  L,  however  large.  As 
L  -►  oo,  both  spectra  will  merge  into  the  same  continuum,  but  with  different  spectral 
densities.  The  difference  Iw„  -  Ten  will  not  vanish  in  general. 

Once  we  are  aware  of  these  pitfalls,  the  evaluation  of  the  fermion  and  boson  densities 
of  states  pF{E)  and  pB(E)  using  the  correct  boundary  conditions  (42-43),  is  straight¬ 
forward.  One  finds  that  the  fermion  level  density  is  in  fact  the  average  of  the  densities  of 
the  two  equations  (40a)  and  (40b),  as  one  would  expect  from  symmetry  between  the 
upper  and  lower  components  of  the  spinor  \j/n.  Given  the  densities  of  states,  the 
fluctuation  energy  (Iw„  -  Ie„)  =  $EdE{pB{E)  -pF (£))  can  be  calculated,  added  on  to 
the  counter  term  to  obtain  the  quantum  soliton  mass  (see  Schonfeld  1979:  Kaul  and 
Rajaraman  1983;  their  results  have  been  rederived,  using  more  elegant  methods  and 
without  recourse  to  boundary  conditions,  by  Imbimbo  and  Mukhi  1984a). 

To  illustrate  the  results,  consider  again  the  example  of  the  Kink  of  the  susy  double¬ 
well  problem.  For  this  system 


((f)2  -  p2/X), 


(44a) 
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while  the  counter  term  added  to  S  is 


Sc.t.  — 


00 


dk 


L 


o  2n  (k2  +  2/r2)1/2 


+  K 


(44b) 


where  K  is  an  arbitrary  finite  constant.  On  computing  —jl,en  +  Mct  one  finds, 
for  the  0(h)  quantum  Kink  mass, 


Midnk  =  + 


(45) 


where  we  have  eliminated  dependences  on  /x  and  K  by  using  m1  which  is  the 
renormalised  mass,  to  0(h\  of  the  boson  in  that  model.  In  the  classical  limit,  the  second 
0(h)  term  in  (45)  is  absent  and  mx  can  be  replaced  by  the  “tree”  level  boson  mass 
m0  =  (2)l/2n  in  the  double-well  problem.  Then  (45)  reduces  to  the  classical  kink  mass 
M(k^k  given  in  (8).  However  there  is  not  much  meaning  in  calculating  the  difference 
between  Mki^k  and  M(k^k  to  see  “how  much”  quantum  correction  the  kink-mass 
acquires.  In  quantum  field  theory,  thanks  to  ultraviolet  divergences  and  their  removal 
by  subtraction  schemes  which  are  not  unique,  the  relation  between  the  same  physical 
quantity  evaluated  to  different  orders  in  h,  contains  arbitrariness.  For  instance,  the  one- 
loop  renormalised  boson  mass  mx  can  be  evaluated  in  terms  of  the  “bare”  boson  m0 
=  (2)1,2fi  by  standard  perturbation  techniques,  but  the  relation  between  and  m0  is 
arbitrary  upto  a  constant.  Any  attempt  to  write  mx  in  terms  m0  and  insert  it  into  (45)  so 
that  it  may  be  compared  with  the  classical  result  (8),  will  be  fraught  with  ambiguity  (see 
Kaul  and  Rajaraman  1983  for  more  details).  But  (45)  as  it  stands,  is  meaningful.  It  gives 
the  one-loop  quantum  mass  of  the  kink  in  terms  of  one-loop  boson  mass.  One  can  also 
meaningfully  compare  (45)  with  \  |  <  sol  |T|  sol )  |,  also  evaluated  to  one-loop  level,  to  see 
if  the  bound  (24)  is  saturated. 

The  one-loop  correction  to  |<sol|T|sol)  has  been  evaluated  independently  by 
Imbimbo  and  Mukhi  (1984a)  as  well  as  by  Chatterjee  and  Majumdar  (1984).  These  one 
loop  corrections  come  from  two  sources: 

(i)  Recall  that  \T  =  [D^x))]®*,.  Classically,  we  evaluated  this  by  inserting  the 
classical  function  </)  =  </>sol(x).  In  quantum  theory  </>(x,  t)  is  an  operator,  which  we  write 
in  terms  of  the  shifted  field  operator  r](x,  t), 

</>(*,  t)  =  </> sol  (*)  +  */(*.  0- 


Then,  at  each  point  (x,  r), 


<sol|D(</>)|sol>  =  D((f>so\)+  (dZ)/d</>)</>sol  <so1|t/|so1  > 

+  2  (d2£/d</>2)</>soi  <  sol  |  7/2  |  sol  >  -l-  ... 

Since  T  involves  D(<f>)  only  at  x  =  +  oo,  note  that  for  the  kink  system  (44) 

_(2)1/2u3  d  D 

0(4>  soi)=+— T;  ^  (Ki)  =  °i  and 

d 2E>  ,  .. 

W>soi)  =  ±  P)  ' \  as  x  ->•  Too. 

Hence  <  sol  | D(4>(x  =  ±  oo,  r))|sol  > 

(2)1/2u3 

=  ±  — — ±  j(2)1,2/i  < sol | rj2( ±  oo,  t)|sol> 


(46) 


(47) 


(48) 
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The  two-point  function  {sol|f;2(x,  t) | sol )  can  be  easily  obtained,  to  leading  order,  by 
using  the  expansion  in  terms  of  the  normal  modes  >]Jx)  in  (37). 


t]{x,  t )  = 


I 


a 


Lr(2w„)i/2 

with  K><C]  = 

This  yields  , 


"  e-iwnf.rin(x)  +  h.c. 


(49) 


< sol  | ?72 ( ±  oo,  r) | sol ) 


00  d/c  1 
0  2n  ( k 2  +  2 fi2)112 


(50) 


The  two-point  function  will  be  ultraviolet  divergent,  but  this  is  cancelled  by  the  counter 
term  (44b)  which  leads  to 


Dcjl  (</>soi  (X  =  ±  oo))  =  ^  Jd05ct ,W>)^</>soi(±  oo ) 


_  hu 

+ 


'oo 


d/c 


(2)1/2LJo  2tt(/c2  +  2^)1/2 


+  K 


(51) 


On  adding  (50)  and  (51),  one  gets  a  finite  result,  which  when  cast  in  terms  of  the 
renormalised  boson  mass  m1  yields 


i|<sol|r|sol>|  =  |i  +  ^T-  +  0(h2,  (52) 

in  agreement  with  the  quantum  soliton  mass  in  (45). 

Thus  we  see  that  for  a  (1  +  l)-dimensional  scalar  field  soliton  like  the  kink,  the 
Witten-Olive  bound  is  saturated  at  the  0(h)  quantum  level.  In  fact,  a  stronger  result 
appears  in  the  recent  work  by  Yamagishi  (1984),  where  the  Witten-Olive  inequality  as 
well  as  its  saturation  in  0(h)  quantum  theory  are  worked  out  at  the  level  of  densities — ie 
relating  the  expectation  values  of  the  Hamiltonian  density  and  the  density  of  the 
topological  index  T.  That  the  bound  holds  to  0(h)  makes  it  plausible  that  it  may  hold  to 
all  orders  in  h.  An  argument  that  it  indeed  does  so  hold,  has  been  briefly  advanced  in  the 
Imbimbo-Mukhi  work. 

Turning  to  the  (3  -I- 1)  dimensional  susy  extension  of  the  Georgi-Glashow  model  (25) 
an  evaluation  of  the  quantum  corrections  to  the  susy  monopole’s  mass  has  recently 
been  done  by  Kaul  (1984).  Although  the  underlying  principles  and  pitfalls  are  similar  to 
those  in  (1  +  1)D  models,  the  actual  calculation  is  more  difficult.  One  has  to  compute 
fluctuation  --  energies  associated  with  the  two  species  of  isotriplet  Fermi  fields  i j/*  and 
1/^2,  the  two  species  of  scalar  fields  </> \  and  (pa2 ,  the  gauge  field  A“,  and  ghost  fields,  and 
that  too  in  3  space-dimensions.  All  this  has  been  ably  done  by  Kaul  (1984).  He  finds  that, 
once  again,  the  boson  and  fermion  fluctuation  energies  do  not  cancel,  and  leave  behind 
an  ultraviolet-divergent  residue.  This  divergence,  again,  is  cancelled  by  counter  terms. 
Further,  the  resulting  answer  for  the  one-loop  monopole  mass  has  the  same  form  as  the 
classical  mass  (16b)  provided  one  replaces  the  vacuum-expectation  value  C  and  the 
coupling  constant  g  by  their  renormalised  values.  That  is 
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Turning  to  the  right  side  of  the  quantum  bound  (34),  as  applied  to  the  monopole,  it  will 
become,  after  one-loop  corrections,  47i(Cm)ren.  Assuming  that  as  required  by  the  exact 
Dirac  condition,  mren  =  l/grcn ,  the  bound  is  saturated. 

In  the  last  stages  of  his  proof  of  (53),  Kaul  uses  the  relation  gren  Cren  =  gC,  ie  that  this 
product  gC  receives  no  renormalisation  correction  in  this  model.  The  validity  of  this 
assumption  as  a  gauge — and  renormalisation-scheme-independent  statement  has 
been  questioned  in  a  very  recent  preprint  by  Imbimbo  and  Mukhi  (1984b).  This  preprint 
reached  us  just  when  we  were  completing  this  review,  and  we  have  not  had  the 
opportunity  to  digest  its  contents.  We  will  merely  report  that  these  authors  also  study 
the  quantum  mass  corrections  for  the  N  =  2  susy  monopole  (as  well  as  the  N  —  4  susy 
extension),  by  using  trace  theorems  which  are  a  generalisation  of  what  they  had 
employed  in  their  earlier  paper  quoted  above.  Their  differences  with  Kaul  notwith¬ 
standing  they  too  find  that  the  quantum  bound  is  saturated  for  the  N  =  2  susy 
monopole. 
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Special  topics  in  the  quantum  theory  of  angular  momentum 
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Abstract.  Two  special  topics  in  the  quantum  theory  of  angular  momentum  are  discussed  in 
this  article.  They  are:  (i)  the  relationship  between  the  coupling  and  recoupling  coefficients  (for 
two  and  three  angular  momenta,  respectively)  and  sets  of  generalized  hyper-geometric 
functions  of  unit  argument;  and  (ii)  the  ‘non-trivial’  or  polynomial  zeros  of  angular  momentum 
coefficients  and  their  classification. 

Keywords.  Quantum  theory;  angular  momentum;  coupling  coefficients;  recoupling  coef¬ 
ficients;  hypergeometric  functions. 

PACS  No.  02  90;  03  65;  21  90 


1.  Introduction 

The  new  developments  in  the  quantum  theory  of  angular  momentum,  during  the  past  25 
years  following  the  dramatic  discovery  of  new  symmetry  properties  of  the  angular 
momentum  coupling  and  recoupling  coefficients  (for  two  and  three  angular  momenta, 
respectively)  by  Regge  (1958,  1959)  led  to  the  subject  itself  becoming  a  part  of  the 
Encyclopedia  of  Mathematics  and  its  Applications.  Volumes  8  and  9  of  this  series,  by 
Biedenharn  and  Louck  (1981)  are  entitled,  “Angular  momentum  in  quantum  physics” 
and  “The  Racah-Wigner  algebra  in  quantum  theory”,  respectively.  In  Chapter  5,  of 
Vol.  9,  twelve  topics  are  developed  and  these  establish  the  diverse  relationships  between 
concepts  in  angular  momentum  theory  and  other  areas  of  mathematics.  The  new 
developments,  in  the  words  of  Smorodinskii  and  Shelepin  (1972)  are  “more  and  more 
intertwined  with  various  sections  of  algebra,  multidimensional  geometry,  topology, 
projective  geometry,  analytic  function  theory,  the  theory  of  special  functions,  differen¬ 
tial  equations,  combinatorial  analysis  and  the  calculus  of  finite  differences.  One  could 
say  that  the  theory  of  Clebsch-Gordan  coefficients  takes  on  the  character  of  a  new  kind 
of  calculus,  going  far  beyond  the  scope  of  the  classical  theory”. 

In  this  article,  we  devote  our  attention  to  only  two  special  topics  of  recent  origin, 
where  we  have  made  some  significant  contributions.  They  are:  (i)  the  relationship 
between  the  coupling  (Clebsch-Gordan)  and  recoupling  (Raeah)  coefficients  and  sets  of 
generalized  hypergeometric  functions  of  unit  argument,  and  (ii)  the  ‘non-trivial’  or 
polynomial  zeros  of  the  Clebsch-Gordan  (3-;)  and  Racah  (6 -j)  coefficients  and  their 
classification. 
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2.  The  relationship  between  the  Clebsch-Gordan  and  Racah  coefficients  and  genera¬ 
lized  hypergeometric  functions 


2.1  The  Clebsch-Gordan  coefficient 

Here  we  show  that  the  series  which  is  given  in  literature  (see  for  instance,  Biedenharn 
and  Louck  1981)  for  the  Clebsch-Gordan  or  3 -j  coefficient  belongs  to  a  set  of  six 
representations  and  that  there  exist,  correspondingly,  a  set  of  six  3F2(l)s,  which  is 
necessary  and  sufficient  to  account  for  the  72  symmetries  of  the  3 -j  coefficient. 

The  3 -j  coefficient  is  defined  by: 


where 


/  7i  72  73 
\ml  m2  m3 


5(ml  +  m2  +  m3)(-l)i'  h  ;3) 


x  n  [C/i+mi)-  0.  —  mi)!]1/2 

/=  l 


X  £(-!)' 


t!  n  (f  —  at)!  n  (A-0! 

k  =  1  1=1 


- 1 


maxfa^aj)  sS  t  minO?„  p2,  P3), 

Pi  =  ji  ~  mt ,  P2  =  k  +  m2,  p3  =  jt  +j2  ~ j 3, 

<*i  =ji~ji  +  m1  =h  -">i  -O3  +  M3X 

“2  =  h  ~h  ~  mi  =  h  +  m2  -  U3  ~ ">3), 


and  A(x^z)  =  [_(x  +  y  —  z)\  ( x  —  y  +  z)\  {  —  x  +  y  +  z)\ / (x  +  y  +  z  +  1)!]1/2.  Equation  (1)  is 
invariant  to  permutations  of  the  three  angular  momenta  (column  permutations)  and 
to  space  reflections  (ml ■ -*  —m^  thereby  exhibiting  a  12-element  symmetry  group. 
Regge  (1958)  arranged  the  nine  integer  parameters  referred  to  by  Racah  (1942),  into  a  3 
x  3  square  symbol  and  represented  the  3 -j  coefficient  as: 


/  7i  72  73  \ 
Vmiw  2  m3/ 


~7i  +7*2  +7*3 
7i  -Wi 
7i  +  mi 


7i  72+73 
7*2  -  m2 
7*2  +  ^2 


7i  + 72  73 
7*3  ~m3 
7*3  + 


The  fact  that  all  sums  of  columns  and  rows  are  equal  to  J  —  jY  +  j2  +  j3  leads  to  the 
following  nine  relations  amongst  the  elements  of 

Rlp  +  Rmp  =  Rnq  +  Rnr’  (3) 


for  cyclic  permutations  of  both  (Imn)  =  (123)  and  (pqr )  =  (123).  Regge  (1958)  dis¬ 
covered  that  there  exists  a  72-element  symmetry  group,  comprising  the  aforementioned 
12-element  symmetry  group,  for  the  3 -j  coefficient,  due  to  the  invariance  of  ||Rik||  to  its 
column  and  row  permutations  and  a  reflection  about  its  diagonal. 

Racah  (1942)  pointed  out  that  substitution  for  the  summation  index  t,  the  argument 
of  one  of  the  five  factorials  in  (1)  only  leads  to  some  symmetry  properties  of  the  angular 
momentum  coupling  coefficient.  It  is  straightforward  to  show  that  the  five  substitution 
procedures  lead  to  five  series  representations.  We  show  here  that  these  together  with  (1) 
constitute  a  set  of  six  series  representations  for  the  3 -j  coefficient,  which  is  necessary  and 
sufficient  to  account  for  its  72  symmetries.  Since  the  72-element  symmetry  group  is 
evident  when  the  3 -j  coefficient  is  represented  by  ||Rik||,  it  is  worthwhile  to  note  that  the 
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set  of  six  series  representations  can  be  written  in  the  following  compact  notation: 
\\Rik\\  =  <3(mj  +m2  +  m3)  n  [Rik\/{J  +  1)]1/2 

i,  k  =  1 

X  (  -  lr^I [s!  (R2p -s)'.(R3q -s)\(Ru-s)\ 

S  / 

x  (S  +  R3r  - Rlp)\ (S  +  R2r  -R3q)\rl  (4) 

for  all  permutations  of  (pqr )  =  (123),  with 

^  _  j  R3p-R2q  for  even  permutations  of  (123) 

^  {  R3p  —  R2q  +  J  for  odd  permutations  of  (123). 

Let  the  six  series  representations  be  denoted  by  the  Roman  numerals  (I)  to  (VI), 
corresponding  to  the  six  permutations  of  {pqr)  in  (4):  (123),  (231),  (312),  (132),  (321)  and 
(213),  respectively.  This  set  of  six  series  representations  can  also  be  obtained  by 
permuting  the  indices  (123)  in  (1),  and  remembering  that  the  series  acquires  an 
additional  phase  factor  of  (  —  l)*7,  for  odd  permutations.  In  establishing  the  one-to-one 
correspondence  between  the  series  obtained  by  the  substitution  procedures  and  that 
obtained  by  permuting  the  indices  in  (1),  use  is  made  of  the  fact  that  4/,  is  an  even  integer 
so  that  ( —  l)4*  is  always  positive.  The  following  observations  can  now  be  made,  with  the 
help  of  (3)  where  necessary: 

(a)  the  column  permutations  of  \\Rik\\  are  in  one-to-one  correspondence  with  the  six 
series  representations,  thereby  spanning  the  whole  set  given  by  (4); 

(b)  the  permutation  of  the  second  and  third  rows  of  ||Rik  ||  which  corresponds  to  space 
reflection,  exchanges  (I)<-*(VI),  (II)  ^(V)  and  (III)  (IV); 

(c)  the  permutation  of  the  first  and  second  rows  of  ||Rik||  exchanges  (I)+->(V), 
(II)^(IV)  and  (III)  —  (VI); 

(d)  the  permutation  of  the  first  and  third  rows  of  ||Rik||  exchanges  (I)<->(IV), 
(II)«-*(VI)  and  (III)<-»(V); 

(e)  the  cyclic  permutation  of  rows,  (123)  — ►  (231),  of  ||Rik||  permutes  anti-cyclically  the 
first  three  and  the  last  three  of  the  six  series  representations,  amongst  themselves,  as:  (I) 

-  (HI)  -  (II)  -  (I)  and  (IV)  ->  (VI)  -  (V)  -  (IV); 

(f)  the  cyclic  permutation  of  rows,  (123)  -» (312),  of  ||Rfk  ||  permutes  cyclically  the  first 
three  and  the  last  three  of  the  six  series  representations,  amongst  themselves,  as:  (I) 

-  (II)  ->  (HI)  -  (I)  and  (IV)  -  (V)  -+  (VI)  ->  (IV); 

(g)  the  reflection  of  || Rik  ||  about  its  diagonal  interchanges  (I)  <-►  (III)  leaving  the  other 
four  series  representations  invariant; 

(h)  the  reflection  of  \\Rik  ||  about  its  skew-diagonal  interchanges  (IV)  <-►  (V)  leaving  the 
others  invariant; 

(i)  each  series  representation  is  left  invariant  to  the  combined  operation  of  an  odd 
column  permutation  and  the  space  reflection;  as  well  as  to  either  the  Regge  symmetries 
(Appendix  1)  as  such  or  Regge  symmetries  on  which  are  superposed  an  even  column 
permutation  or  an  odd  column  permutation  and  the  space  reflection; 
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(j)  each  series  representation  exhibits  12  of  the  72  distinctly  different  symmetries  of 
the  3 -j  coefficient  and  this  12-element  symmetry  group  is  isomorphic  to  the  product  of 
permutation  groups  of  3  objects  ( Rlp ,  R3q,  Rlr)  and  2  objects  (R3r  —  R2p,  R2r  —  R3q). 

From  the  observations  (a)  to  (j)  it  follows  that  the  set  of  six  series  representations  is 
necessary  and  sufficient  to  account  for  the  72  known  symmetries  of  the  3 -j  coefficient. 

Conventional  ways  (refer,  for  instance,  deShalit  and  Talmi  1963)  of  establishing  the 
symmetry  relations  are  tedious  and  some  (Rose  1955)  necessarily  resort  to  the 
substitution  procedures.  Our  observation  that  ||Rlk||  is  in  fact  a  set  of  six  series 
representations  enables  one  to  establish  the  symmetries  easily.  Notice  that  the  first  three 
series  corresponding  to  even  column  permutations  are  separated  from  the  last  three 
series  which  correspond  to  odd  column  permutations,  in  the  sense  that  the  latter  contain 
the  additional  phase  factor,  ( —  1)J. 

The  series  representations  can  be  rearranged  into  generalized  hypergeometric 
functions  (Slater  1966)  of  unit  argument  by  using: 


n!  =  r(n  +  l)  (5) 

for  the  factorials  and  whenever  the  summation  index  in  the  argument  of  the  T  function 
is  negative  resorting  to: 

r(n)T(l  —n)  —  n  cosec  nn  (6) 


to  replace  it  by  a  T-function  with  an  argument  containing  a  positive  index  of 
summation.  This  procedure  of  rearrangement  yields  in  the  case  of  the  3 -j  coefficient 
(Srinivasa  Rao  1978;  Venkatesh  1978)  the  following  set  of  six  3/r2(l)s: 


HR*  ||  =  <5(w,  +m2  +  m3) 


3 


n  **!/(•/+!) 

i,k  =  1 


1/2 


x  (  —  l  [  r  ( 1  —  A,  1  -B,  1  -C,D,  £)]-* 


x  3F2  (ABC;  DE;  1),  (7) 

where 

A=  —R2p,  B  —  —  R3q,C  =  —  Rir,  Z)  =  1  +  R3r  —  R2p, 

E  =  l+R2r-R3q  and  T(x,y, .  .  .)  =  r(x)T(y) 

for  all  permutations  of  (pqr)  =  (123).  This  set  of  six  3F2(l)s  given  here  are  all  of  the  van 
der  Waerden’s  form  (Smorodinskii  and  Shelepin  1972) — the  3F2(1)  given  by  (5*21)  of 
this  reference  corresponds  to  (pqr)  =  (123). 


2.2  The  Racah  coefficient 

Here  we  show  that  it  is  possible  to  obtain  two  equivalent  sets  of  series  representations 
for  the  Racah  (6 -j)  coefficient,  which  can  be  rearranged  into  two  equivalent  sets  of 
4F3(1)s.  These  sets  are  shown  to  be  related  to  each  other  through  the  property  of 
reversal  of  series  of  the  generalized  Saalschutzian  hypergeometric  function. 

The  conventional  expression  for  the  Racah  coefficient  (Biedenharn  and  Louck  1981) 
is  given  by: 


2^J  =  (-l  )a+b+c+dW(abcd;ef) 
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=  NX(-l)p(P+1)! 

p 

where  the  range  of  P  is 

Pmin  ^  P  ^  Pmax> 

with  Pmin  =  max(oc1,  0(2,  ^3,  £4), 

Pmax  =  min  (pit  p2,  p3), 
a^a  +  b  +  e,  a  2  =  c  +  d  +  e,  a3  =  a  +  c+/,  a4  =  b  +  d+/, 

Pi^a  +  b  +  c  +  d,  p2  =  a  +  d  +  e+f  /?3  =  6  +  c-l-e-l-/, 

N  —  A  (abe)  A  (cde)  A  (ac/)  A  (bdf), 

and  A(xyz)  defined  as  before  (below  equation  (1)). 

The  6 -7  coefficient  is  invariant  under: 


ncp-a*)!  YUPj~pV 


i  =  1 


j=  1 


- 1 


(8) 


(i)  the  3!  (  =  6)  column  permutations  and 

(ii)  the  four  interchanges  of  any  two  elements  in  the  first  row  of  the  6-7  symbol  with 
the  corresponding  elements  in  the  second  row  (this  will  be  referred  to  as  ‘row’ 
permutation  in  the  text).  These  24  symmetries  constitute  the  classical  tetrahedral 
symmetry  group  of  the  6 -7  coefficient. 

In  passing  it  is  to  be  noted  that  this  series  form  (8)  can  be  found  for  the  first  time  in 
Regge’s  (1958)  article.  Racah  (1942)  dealt  with  only  the  series  expansion  obtained  by 
substituting  s  =  pl—P  in  (8),  which  corresponds  to  one  of  a  set  of  three  represen¬ 
tations,  to  be  discussed  later. 

Once  the  series  is  in  the  form  (8),  it  is  easy  to  see  that  the  6 -j  coefficient  exhibits  the 
144-element  symmetry  group,  due  to  its  invariance  under  the  permutation  of  the  four 
a’s  and  the  three  /T s.  It  was  Regge  (1958)  who  dramatically  discovered  six  more 
symmetries  (Appendix  2)  and  established  that  the  Racah  (6-7)  coefficient  exhibits  144 
symmetries  and  not  only  the  24  tetrahedral  symmetries. 

By  setting  in  (8),  s  =  pk  —  P,k  =  1,  2,  3,  in  succession,  we  get  the  set  I  of  three  series 
expansions: 


Ni-iy^K-im-s+D! 


x 


ri  (&-«,-*)!  n  (s+pi-w 

1=1  ;=i 


-1 


Notice  that  a  series  belonging  to  this  set  I  exhibits  only  48  of  the  144  symmetries,  due  to 
the  permutation  of  all  the  four  a’s  but  only  two  of  three  /Ts,  since  is  now  in  the 
numerator  in  (9). 

Instead,  if  we  set  in  (8),  s  =  P  —  oih  l  =  1,  2,  3,  4,  in  succession,  we  obtain  (Srinivasa 
Rao  and  Venkatesh  1977)  the  following  set  II  of  four  series  representations: 


X 


n  (s +  “(-«;)!  n  (Pj~ai  +  s)! 

1=1  j- 1 


-1 


(10) 


where  we  notice  that  a  series  belonging  to  set  II  (10)  exhibits  only  36  of  the  144 


20 


K  Srinivasa  Rao 


symmetries,  arising  due  to  the  permutation  of  all  the  three  /Ts,  but  only  three  of  the  four 
a’s  since  a*  is  now  in  the  numerator  in  (10). 

Thus,  the  144  symmetries  of  the  Racah  (6 -j)  coefficient  are  exhibited  by  the  single 
series  expansion  (8),  or  by  the  set  I  of  three  series  representations  given  by  (9),  or, 
equivalently,  by  the  set  II  of  four  series  representations  given  by  (10). 

When  (9)  is  rewritten  in  terms  of  generalized  hypergeometric  functions  of  unit 
argument,  we  get  (Srinivasa  Rao  et  al  1975),  the  set  I  of  three  generalized  Saalschutzian 
hypergeometric  functions  of  unit  argument: 

=  (-l)£+1ivr(l  -£)[T(  1  —  A,  1  - B ,  1  -C,  1  -A  F,  G)]'1 

x  4F3(ABCD;  EFG ;  1),  (11) 

\  , 

where 

A  —  e—  a—  b,  B  =  e  —c  —d,  C  —  f—  a—c,D=  f—  b  —d, 

E  —  —a  —  b—c—d  —  1,  F  =  e+f—a  —d  +  1,  G  =  e+f—b  —c  +  1 

and  T(xyz  ....)  =  r(x)T(>’)r(z)  ....  Superposing  the  column  permutations  of 
{dc}}  on  the  4F3(1)  in  (11),  yields  us  the  set  I  of  three  4F3(l)s.  We  note  that  the 
superposition  of  a  ‘row’  permutation  of  {Sc/}  on  the  4F3(1)  in  (11)  results  only  in  a 
permutation  of  the  numerator  and  denominator  parameters  amongst  themselves  in  a 
given  4F3(1)  belong  to  this  set  I. 

When  (10)  is  rearranged  into  a  set  of  hypergeometric  functions,  we  obtain  (Srinivasa 
Rao  and  Venkatesh  1977)  the  following  set  II  of  four  4F3(l)s  for  the  6-j  coefficient: 

^dbcef\  =  (-\y-2Nr(A')[T(\-B',\-C,\-D',E',F',G')Yl 

x  4F3(A'B'CD';  E'F'G';  1),  (12) 

where 

A'  =  a  +  c  +  f  +2,  B'  —  c—  d—  e,  C  —  a  —  b—  e,  D'  =  /—  b—d , 

E'  —  a  *+■  c  —  b  —  d  -f-  1,  F'  —  a  -\-f  —  d  —  cFl,  G  =  c  -\-f  —  b  —  6  1. 

The  set  of  four  4F3  (l)s  is  spanned  by  superposing  the  interchange  of ‘row’  permutations 
of  {Sc/}  on  the  given  4F3  (1)  in  (12).  Superposing  the  column  permutations  of  the  6-j 
coefficient  on  a  given  4F3(1)  belonging  to  this  set  results  only  in  a  permutation  of  the 
numerator  and  denominator  parameters  amongst  themselves. 

Since  more  than  one  numerator  parameter  in  the  set  I  and  set  II  of  4F3(l)s  is  a 
negative  integer,  we  can  generalize  the  property  of ‘reversal  of  the  series’  given  by  Bailey 
(1935)  for  the  case  of  a  3F2(1),  to  the  case  of  a  Saalschutzian  4F3(1)  to  obtain  the 
identity: 

4F3 (ABCD;  EFG ;  1)  =  (-1)DT(1  -A,  l-B,  1  -C,  F,  G,  D  — F+l) 
x  [F{D  —  A+  1,  D  —  B+l,  D-C+l,  F  — D,  G  — D,  1  — F)]-1 
x4F3(Z)-F+1,  D— F+l,  D-G+l,  D;  D-A+  1,  D  —  B+  1,  D  — C+  1;  1),  (13) 

where  D  is  the  negative  parameter  which  determines  the  number  of  terms  in  the  series. 
Using  (13)  in  (11),  after  some  simple  algebraic  manipulations,  we  get: 

=  (-1  )A'NT(A%r(F\  G',  E\  1  -C,l  —B\  1  —  D')]_1 


x  4F3(/}'C'B'D';  F  G  F';  1), 


(12') 
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which  is  nothing  but  the  4F3(1)  belonging  to  set  II  given  by  (12)  except  for  a  trivial 
permutation  amongst  the  numerator  and  denominator  parameters  of  the  generalized 
hypergeometric  function. 

Thus,  we  find  that  the  set  I  and  set  II  of  4F3(l)s  for  the  6 -j  coefficient  are  not 
independent  but  are  related  to  one  another  by  the  property  of  reversal  of  series.  It  is 
intriguing  to  note  that  due  to  reversal  of  series  property  the  set  I  of  three  4F3(l)s  yields 
the  set  II  of  four  4F3(l)s  and  vice  versa. 

It  has  been  possible  for  us  (Srinivasa  Rao  and  Venkatesh  1978;  Srinivasa  Rao  198 1)  to 
show  the  advantages  of  using  the  sets  of  pFq(l)s  in  the  numerical  computation  of  the  3  -j 
and  the  6 -j  coefficients.  The  pFq(l)s  can  be  computed  numerically  using  Horner’s  rule 
(Lee  1966)  for  polynomial  evaluation  as: 


where 


pf,(« i<*2  •••<*„;  P1P2  •  ••£,; z) 


1  + 


■ p 5  t-  ir  z  '  ’  ‘  t'q4 

X0  f  X\(  X' 


>0 


Z  + 


y  1 


z  + 


>2 


z  + 


*/ 


=  fl  (aj +  0  and  >’£  =  0  + 1 )  fl  (Pk  +  0- 


j=  1 


k=  1 


(14) 


The  use  of  (14)  places  our  approach  of  numerical  computation  on  the  same  footing  as 
that  of  Wills  (1971)  and  in  the  nested  form  the  number  of  multiplications  is  minimum. 
In  fact,  the  expression  of  Bretz  (1976)  is  identical  to  the  set  II  of  4F3(l)s  given  by  (12). 
Thus,  the  set  of  pFq(\)s  which  have  been  shown  to  be  necessary  and  sufficient  to  account 
for  the  symmetries  of  the  3 -7  and  the  6 -7  coefficient  have  been  found  to  be  not  only 
useful  but  also  time-saving  when  compared  to  the  best  available  computer  (Fortran) 
programs  of  Wills  (1971)  and  Bretz  (1976)  by  about  5  to  15%. 

D’Adda  et  al  (1974)  and  Raynal  (1978,  1979)  make  use  of  Whipple’s  (1925,  1936) 
work  on  the  symmetries  of  3F2  functions,  well-poised  7F6  functions  and  Saalschutzian 
4F3  functions,  all  with  unit  argument,  to  study  the  3 -7  and  the  6 -7  symbols  generalized 
to  any  arguments.  Raynal  (1978)  generalized  one  of  the  formulas  for  the  3 -j  symbol  of 
SU  (2)  to  obtain  a  generalized  3F2(1)  with  complex  parameters.  Biedenharn  and  Louck 
(1981),  discuss  this  aspect  of  the  Racah-Wigner  algebra — viz  the  relationship  of  the  3 -j 
and  the  6-j  coefficients  to  generalized  hypergeometric  functions — as  a  special  topic  in 
Vol.  9  of  the  Encyclopedia  in  Mathematics  and  its  Applications  and  summarize  the 
results  of  our  work  presented  here  in  a  different  notation. 


3.  ‘Non-trivial’  or  polynomial  zeros  of  angular  momentum  coefficients  and  their 
classification 

Here  we  show  that  the  polynomial  or  ‘non-trivial’  zeros  of  the  Clebsch-Gordan  (3 -j) 
and  the  Racah  (6-7 )  coefficients  can  be  classified  according  to  their  degree.  The  majority 
of  the  zeros  tabulated  to-date,  hitherto  considered  as  non-trivial,  are  in  fact  polynomial 
zeros  of  degree  one,  arising  due  to  the  existence  of  binomial  expansions  for  these 
coefficients. 

3.1  Clebsch-Gordan  coefficient 

Trivial  zeros  of  the  3 -7  coefficient  are  those  which  arise  due  to  either  the  triangular 
inequality  not  being  satisfied  by  jlJ2  andy3;  or  due  to  J  (  =  jl  +  7 2  +73)  being  odd  when 
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mx  =  m2  =  m3  =  0  (which  can  be  easily  verified  as  the  consequence  of  the  symmetry 
property  of  the  3 -j  coefficient  under  spatial  reflection:  m,  -►  —  m,).  The  ‘non-trivial’ 
zeros  are  identified  as  the  zeros  of  the  polynomial  part  of  the  3 -j  coefficient,  defined  in 
(4).  They  are  also  called  as  “structural”  zeros. 

Sato  and  Kaguei  (1972),  used  the  concept  of  generalized  powers  to  obtain  a  formal 
binomial  expansion  for  the  3 -j  coefficient.  Following  the  procedure  given  by  Sato 
(1955)  and  using  the  symbolic  notation  for  generalized  powers: 

P(x)  =  P!/(P  —  x)!,  (15) 

we  can  rearrange  the  set  of  six  series  representations  given  by  (4)  into  the  form: 


R* II «  * (m,  +  m2  +  I»3)  n  [R« !/(■/+  1)]1/2  ( - 1  rpqr) 


i,k 


x  [T(n-f  1,  CM  +  1,  Cv+  1,  Brp  +  n  +  1,  Brq  +  n  +  1)] 
x  { (Brp  +  n)  ( Brq  +  n)  -  Cu  Cv  } (n), 


-l 


(16) 


where  n  —  min (R2p,  3q,  Rlr ),  Cu,  Cv  represent  the  Rik s  in  the  triple  ( Rlp ,  R3q ,  Rlr)  other 
than  the  minimum,  Brp  —  R3r  —  R2p  and  Brq  =  R2r  —  The  expression  (16)  is  a 
generalization  of  the  formal  binomial  expansion  obtained  by  Sato  and  Kaguei  (1972), 
for  their  result  can  be  obtained  by  putting  (pqr)  —  (231)  in  (16),  while  (16)  itself  holds 
for  all  the  six  permutations  of  (pqr)  —  (123). 

Obviously,  since  the  generalized  power  (15)  is  exact  for  n  =  1,  (since  P(l)  =  P)  the 
binomial  form  for  the  3 -j  coefficient  explicitly  reveals  further  structure  zeros  of  this 
coefficient.  These  can  be  shown  (Srinivasa  Rao  and  Rajeswari  1984a)  to  be  polynomial 
zeros  of  degree  one.  We  can  simply  locate  all  the  polynomial  zeros  of  degree  one  of  the 
3 -j  coefficient,  independent  of  the  numerical  values  of  the  arguments  of  the  3 -j 
coefficient,  with  the  help  of  the  simple  factor: 

iX-Ky  ^.i)’  <17> 


.  where  x  and  y  are  given  by: 


x  =  Rmr  Rkp  and  y  =  Rmp  R^, 

with  (Imk)  and  (pqr)  corresponding  to  specific  permutations  of  (123)  when  n  =  Rlq. 

Polynomial  zeros  of  the  3 -j  coefficient  were  listed,  for  J  (  =  jx  +72+73)  ^  27  by 
Varshalovich  et  al  (1975).  Bowick  (1976)  used  the  Regge  symmetries  for  the  3-7  and  the 
6-7  coefficients  to  obtain  listings  of  only  the  Regge-inequivalent  polynomial  zeros.  The 
number  of  terms  in  the  polynomial  part  of  the  3-j  coefficient  is  governed  by:  n 
=  min (R2p,  B3q,  Rir)  an<^  we  use  this  simple  prescription  to  classify  the  polynomial 
zeros  tabulated  by  Bowick  (1976)  and  by  Biedenharn  and  Louck  (1981)  into  those 
which  correspond  to  degree  1,  2  and  4.  The  classified  tables  can  be  found  in  Srinivasa 
Rao  and  Rajeswari  (1984b).  From  the  tables,  we  conclude  that  for  J  ^  27,  of  the  36 
polynomial  zeros,  21  are  polynomial  zeros  of  degree  1,  revealed  by  the  exact  binomial 
expansion  for  this  coefficient,  represented  by  the  multiplicative  factor  (17). 


3.2  The  Racah  coefficient 

Trivial  zeros  of  the  Racah  coefficient  are  those  which  arise  due  to  a  violation  of  one  of 
the  four  triangular  inequalities  to  be  satisfied  by  the  six  angular  momentum  arguments. 
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The  existence  of  a  class  of  zeros  of  the  6  -j  coefficient,  not  due  to  the  aforesaid  reason  has 
been  called  as  polynomial,  ‘non-triviaf  or  structure  zeros  by  Koozekanani  and 
Biedenharn  (1974). 

Sato  (1955)  expressed  the  Racah  coefficient  symbolically  as  a  binomial  expansion. 
Following  his  notation  but  making  a  different  set  of  substitutions,  corresponding  to 
those  required  for  getting  the  set  I  of  series  expansions  given  by  (9),  given  below: 

s  =  P0-P,  0  ^  s  ^  n,  n  =  /?0  -  a0, 

Ai  =  £o  ~  O'  =  P,  q,  r\  Bj  =  pj  -  p0  ( j  =  u,  v\  (18) 

where  the  indices  p,  q ,  r  and  w,  v  are  used  for  those  values  of  the  a’s  and  P's  other  than 
Fjnin  =  max  (al9  a2,  a3>  a4)  =  a0  and  Pmax  =  min(j Slip2,  P3)  =  P0,  we  obtain  (Srinivasa 
Rao  and  Venkatesh  1977)  the  binomial  expansion: 

=  N  (  —  l)^0[T(n  +  1,  Ap+  1,  Aq  +  l,  Ar+  1,  Bu  +  n+  1,  B^-fn-fl)]  1 

x  { (Bu  +  n)  (Bv  +  n)  — A pAqAr  ( +  1 )( ~ 1  > } (n)  (19) 

where  we  used  the  notation: 

P(X)  =  pj/(p_x)t  and  P("x)  =  1/PW 


and  regarded  the  form  P(x)  to  represent  the  generalized  powers  of  P.  It  follows  that  (19)  is 
an  exact  binomial  expansion  for  the  power  n  =  1.  The  structure  zeros  which  arise  from 
the  polynomial  part  of  the  6-j  coefficient  are  indeed  polynomial  zeros  of  degree  one.  All 
these  polynomial  zeros  of  degree  one  are  accounted  for  by  the  simple  factor: 


(1  -KyKi),  (20) 

where 

X  =  (£u  +  n)(Bv  +  n)  (&>  +  !), 

=  (Pu  ~  ao)  (Pv  ~  ao)  (Po  +  1)» 

and 

Y=ApAqAr, 

=  (Po  -  ap)  (Po  -  (Po  -  ar)-  (21) 


Koozekanani  and  Biedenharn  (1974)  calculated  the  6-j  coefficient 


7  i  7  2  7  3  1 

h  1 2  h  J 


for 


arguments  ;■>  h  =£  18*5  for  i=l,  2,  3  and  found  its  polynomial  zeros.  Using  the 
symmetries  of  the  6-j  coefficient,  they  ordered  the  arguments  j1J2J3,  h,  l2>  h, in  a 
speedometric  fashion  with  jx  the  slowest  varying  and  l3  the  most  rapidly  changing 
variable.  The  vanishing  values  of  the  6-j  coefficient  were  calculated  by  a  computer 
program  which  resorted  to  the  use  of  numbers  decomposed  into  powers  of  prime 
factors.  Bo  wick  (1976)  used  the  Regge  symmetries  (Appendix  2)  of  the  6-j  coefficient  to 
obtain  a  reduced  listing  of  the  Regge-inequivalent  polynomial  zeros  for  arguments 
^  18-5. 

On  the  other  hand,  we  have  used  a  simple  program  which  checks  for  the 
multiplicative  factor  (20)  being  equal  to  zero,  to  find  all  the  polynomial  zeros  of  degree 
one,  for  jh  /,  ^  18*5  for  i  =  1,  2, 3.  Having  separated  the  majority  of  the  zeros,  we  sorted 
out  the  remaining  polynomial  zeros  tabulated  by  Koozekanani  and  Biedenharn  (1974), 
according  to  their  degree  given  by  n  =  P0-a 0.  These  tables  can  be  found  in  Srinivasa 
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Rao  and  Rajeswari  (1984b).  From  our  classified  tables  we  conclude  that  1174  out  of 
1420  polynomial  zeros  tabulated  by  Koozekanani  and  Biedenharn  (1974)  are 
polynomial  zeros  of  degree  one,  revealed  by  (20)  which  is  a  consequence  of  the  exact 
binomial  expansion  (19). 

We  have  also  shown  (Srinivasa  Rao  1985)  that  of  the  12  generic  or  Regge- 
inequivalent  zeros  which  have  been  explained  to  be  due  to  either  triangle  rule  violation 
for  quasi-spin  (Koozekanani  and  Biedenharn  1974),  or  vanishing  of  fractional 
parentage  coefficients  in  the  atomic  g-shell  (Judd  1970),  or  realizations  of  exceptional 
Lie  algebras  G2,  F4  and  Ee  (Van  der  Jeugt  et  al  1983;  Van  den  Berghe  et  al  1984) — eleven 
are  trivial  polynomial  zeros  of  degree  one.  We  also  conjectured  (Srinivasa  Rao  1984) 
that,  in  principle,  one  can  find  closed  form  formulas  for  the  polynomial  zeros  of  the  3 -j 
and  the  6-j  coefficients,  provided  we  look  upon  these  coefficients  as  generalized 
hypergeometric  functions  of  unit  argument,  which  are  analytic,  and  extend  the  method 
of  Siewert  and  Burniston  (1972)  to  determine  zeros  of  analytic  functions  to  the  case  of 
analytic  pFq(l)s. 
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Appendix  1 

Regge  (1958)  symmetries  for  the  3 -j  coefficient,  written  down  explicitly  (Srinivasa  Rao 
1978)  are: 

/ A  A  A  \  / A  l(A+A  +  »*i)  iUi+h-™ i)  \ 

I  m,  m2miJ  \j2-j3  2  (A  ~A  +  m1)  +  m2  i  (A -j2  +  mj)  +  m3  / 

=  fiUi  +j3  +  m2) 

V2O3 -ji+m2)  +  mi 

=  /4(A  +ii~m  3) 

\iUi -j2  +  mi)  +  mi 

=  { \(h+j2~m3 ) 

\j3-iU1  +ji+m3) 

=  ( ?Ul+j2+m3 ) 

\iUi+j2-m3)-j3 

Only  the  first  and  fourth  of  these  symmetries  are  essentially  new,  the  others  can  be 
obtained  from  these  and  the  column  permutations  and  space  reflection  symmetries.  For 
an  interpretation  of  the  symmetry  of  the  Clebsch-Gordan  coefficients  discovered  by 
Regge,  refer  Bincer  (1970) 


h  2U1  +j3-m2)  \ 

ji  ~j3  2  ( h  ~h  +m2)  +  m3) 

l(ji+j2  +  m3)  j3  \ 

UJi  -j2  +  m3)  +  m2  j2-j1) 

l(j2+j3~mi)  2  (A  +j3-m2)  \ 

ji  —  2  (A  +A  +  mi)  J2  ~  2  Ui  +A  +  m2)  J 

i(j2+j3  +  ml)  2  (Ji  +A  +  m2)  \ 

2  Ut  +h  ~mi )-ji  2  (A  +A  ~ m2 )  -j2  ) ' 
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Appendix  2 


The  Regge  (1959)  symmetries  for  the  Racah  coefficient  are: 


abe\  =  [a  iib  +  c  +  e-f)  \(b  —c  +  e  +/)  j 
dcf  j  \b  {(b  +  c-e+f)  ^(-6  +  c  +  c+/)J 


2  (a  —  d  +  c  +/) 

2  ( — + 


b  \{a  +  d  +  e  — /)  1 
c  i(a  +  ^-c+/)j 


2  (fl  ~E  h  4-  c  —  d) 
a-\-b-hc-\-d) 


\(a  +  b  —c  +  d)  cl 
{  (a  —  b  +  c  +  d)  /  j 


2(^  +  c  +  c— /)  2  (a  ~ ^  +  £+/) 

^(h  +  c—  e+f)  \  (  —  a  -f  d  +  e  -f/) 

\  (b  —  c  +  c  +/)  ^(a-l-d  +  c— /) 

2  ( —  ^  +  c -f  c +/)  +  d  —  e +/) 


^(a  +  /?-c  +  i/)l 
2(6!  —  6  +  c  +  J)j 

^(fl  +  6  +  c — d)  1 
2  ( —  a-{-b-{-c-yd)j 
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Modified  perturbation  series  for  the  anharmonic  oscillator  using 
linearization  technique 
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Abstract.  The  linearization  technique  of  random  phase  approximation  is  applied  to  the 
anharmonic  oscillator  to  find  a  modified  perturbation  series.  It  is  shown  that  for  the 
anharmonic  term  XxA,  the  ground  state  energy  £0  upto  the  second  order  of  perturbation  is 
given  by  E0  =  (35/48)  (3/4)1/3  A1/3  as  X  -  oo. 

Keywords.  Perturbation  series;  anharmonic  oscillator;  random  phase  approximation. 
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1.  Introduction 

From  the  very  beginning  of  quantum  mechanics  it  has  been  known  that  there  are  very 
few  eigenvalue  problems  which  can  be  exactly  solved  analytically.  For  physical 
problems  in  reality  one  can  either  find  the  eigenfunctions  and  eigenvalues  numerically 
or  resort  to  some  kind  of  perturbation  treatment  (Thouless  1972).  In  the  straightfor¬ 
ward  application  of  perturbation  theory  to  many  problems  it  is  found  that  successive 
terms  in  perturbation  expansion  do  not  converge,  and  hence,  various  modified  forms  of 
expansion  have  been  proposed  from  time  to  time.  In  the  present  work  we  shall  use  the 
linearization  technique  of  Random  Phase  Approximation  (rpa)  (Rowe  1970)  to  find  a 
modified  perturbation  series.  Extensive  literature  on  the  treatment  of  anharmonic 
oscillators  (Hioe  and  Montroll  1975;  Singh  et  a\  1978)  being  available,  we  have  chosen 
the  anharmonic  oscillator  problem  for  comparison  of  the  modified  perturbation  series 
obtained  using  the  linearization  technique. 

We  describe  the  formulation  in  §  2  and  present  the  conclusions  in  §  3. 


2.  Formulation 


Let  us  consider  a  hamiltonian  H  given  by 


H  = 


2  dx2 


1  2  , 
+  -  x2  +  Xx 
2 


4 

5 


where  /be4,  X  >  0,  is  the  anharmonic  perturbation.  Applying  the  ordinary  perturbation 
theory  treating  the  anharmonic  term  Xx4  as  perturbation,  the  series  start  diverging  with 
the  increase  in  the  coupling  constant  X. 

The  essential  idea  of  the  linearization  technique  is  to  approximate  the  term  x4  as 
<  x2  >  x2,  where  <  x2  )  is  the  expectation  value  of  x2  with  respect  to  some  vacuum  state. 
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The  perturbation  expansion  is  then  applied  to  the  left-over  term.  We  therefore  split  H  in 
the  following  way: 


where 


and 


H  =  H0  +  Hlt 


1  d2  1 

2  dx2  "**  2 


1+2/?  be 


H i  =  Xx*-px2, 


(2) 

(3) 

(4) 


where  /?  is  an  unknown  constant  to  be  determined. 

The  eigenfunctions  of  H0  can  immediately  be  written  as 


b  —  V 1  +  2  /?, 


(5a) 

(5b) 

(5c) 


The  next  problem  is  to  determine  the  unknown  constant  /?.  From  our  knowledge  of 
manybody  physics  (Nesbet  1955)  we  know  that  the  largest  configuration  interaction 
effects  arise  from  the  lowest  configurations.  We,  therefore,  determine  /?  by  postulating 
that  the  matrix  element  of  between  ij/0  and  i j/2  must  vanish.  This  gives  the  following 
cubic  equation  to  determine  /?: 

p2  (1  +  2p)  —  9X2  =  0.  (6) 

Simple  calculations  now  give  the  contributions  to  the  ground  state  energy  E0  in 
various  orders  of  perturbation.  We  present  these  results  in  table  1,  where  the  results 
obtained  using  ordinary  perturbation  theory  are  also  shown. 

We  calculate  the  ground  state  energy  for  k  —  1,  which  gives  us  an  idea  about  the 
accuracy  of  the  present  formulation.  We  find  from  (6)  that  for  this  case  ft  =  3/2.  From 
table  1,  we  get  the  value  of  E0  upto  the  second  order  of  perturbation  to  be  E0  —  0801. 
The  value  of  E0  as  given  by  Hioe  and  Montroll  (1975)  using  the  Bargman 
representation  is  E0  =  0-804.  Thus  our  E0  has  an  error  of  0-4%,  which  is  quite  small. 

The  other  interesting  case  is  the  one  in  which  the  coupling  constant  X  becomes  very 


Table  1.  Contributions  of  various  orders  of  perturbation  £(0n)  to  ground  state 
energy. 


Contribution  to 
ground  state  energy 

n01 

E'o" 

EX' 

ordinary" 

h 

f  A 

-21/8  (yl2) 

modified* 

—  3A/4  (1  +2fi) 

—  |A2/(1  +2/f)s/2 

fl  ordinary:  from  ordinary  perturbation  theory;  b  modified:  by  use  of  the  present 
formulation 
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large.  For  this  case  (6)  gives  the  following  value  of  /?: 

P  =  (9/2)1/322/3.  (7) 

A  ->  oo 

Using  this  value  of  p  in  table  1,  we  find  that  the  ground  state  energy  E0  upto  the  second 
order  of  perturbation  is  given  by 

£0  =  35/48  (3/4)1/3  A1/3, 

E0  =  0.66  21/3.  (8) 

This  is  quite  close  to  the  value  of  E0  =  0.67  A113  as  given  by  Hioe  and  Montroll  (1975). 


3.  Concluding  remarks 

We  have  shown  how  the  linearization  technique  of  RPAcan  be  used  to  obtain  a  modified 
perturbation  expansion.  The  results  upto  the  second  order  in  perturbation  agree  very 
well  with  the  accurate  values  given  by  Hioe  and  Montroll  (1975).  The  formulation 
proposed  in  the  manuscript  is  fairly  general  and  can  be  applied  to  other  physical 
problems.  Thus,  though  it  may  not  always  be  possible  to  linearize  the  perturbation 
operator  it  may  be  modified  by  introducing  suitable  expectation  values  to  obtain  a 
modified  series. 
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Abstract.  We  propose  an  analytic  perturbative  approach  for  the  determination  of  the 
Feigenbaum-Cvitanovid  function  and  the  universal  parameter  a  occurring  in  the  Feigenbaum 
scenario  of  period  doubling  for  approach  to  chaotic  behaviour.  We  apply  the  method  to  the 
case  Z  =  2  where  Z  is  the  order  of  the  unique  local  maximum  of  the  nonlinear  map.  Our  third 
order  approximation  gives  a  =  2-5000  as  compared  to  “exact”  numerical  value 
a  =  2-5029  ....  We  also  obtain  a  reasonably  accurate  value  of  the  Feigenbaum-Cvitanovi<3 
function. 
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1.  Introduction 


The  discovery  by  M  Feigenbaum  of  universal  quantitative  behaviour,  in  the  maps  of  an 
unit  interval  onto  itself,  has  generated  enormous  excitement  as  it  opens  up  the 
possibility  of  quantitatively  understanding  turbulence  and  other  chaotic  natural 
phenomenon.  In  Feigenbaum  scenario  a  system  follows  period  doubling  route  to  chaos 
(Feigenbaum  1978,  1979,  1980). 

In  this  theory  asymptotically,  at  each  period  doubling  the  separation  between  any 
two  corresponding  adjacent  elements  of  a  periodic  attractor  is  scaled  by  a  constant 
ratio  a.  This  leads  to  the  existence  of  a  function  g(x )  which  reproduces  itself  under  the 
mapping  except  for  the  relevant  scaling.  This  function  satisfies  the  equation, 

-  a  (*/“))  =  g(x),  (1) 

and  is  normalised  by 

0(0)  =1.  (2) 


We  shall  refer  to  this  equation  and  g(x)  as  Feigenbaum-Cvitanovic  equation  and 
function  respectively  (Feigenbaum  1978).  For  a  generalisation  to  period  n-tuplings  of 
this  equation  we  refer  to  Cvitanovic-Myrheim  (1983).  Equation  (1)  together  with 
normalisation  (2)  determines  both  the  functional  form  of  g(x)  and  a.  We  shall  be 
interested  in  the  solution  which  has  a  local  maximum  at  x  =  0  of  the  order  Z.  Such  a 
solution  is  unique  (Feigenbaum  1980,  p.  15;  Collet  et  al  1980). 

The  universal  function  g(x)  and  a  do,  of  course,  depend  on  the  order  of  local 
maximum  i.e.  if  the  transformation  T  considered  is 


T:x  -*■  x'  =  A(1 


a 


n 
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then  they  depend  on  Z.  For  Z  =  2,  which  is  the  most  interesting  case,  we  have 
a  =  2-502907875  ... 

The  function  g(x)  has  also  been  determined  numerically  for  this  case  (Feigenbaum 
1979). 

Another  important  universal  constant  S ,  which  determines  the  rate  of  period- 
doublings,  is  given  by 

S  =  4-6692016  .  .  . 

for  Z  =  2.  A  consideration  of  asymptotic  period-doubling  leads  to  the  equation 

-oc[h(g(x/oL))  +  g'{g(x/ct))h(x/cc)]  =  Sh(x) 

S  >  1  (3) 

for  a  determination  of  the  universal  function  h(x)  and  universal  constant  S.  Again 
numerical  determination  of  h(x)  and  <5  has  been  carried  out  (Feigenbaum  1979). 

In  view  of  the  importance  of  these  functions  it  would  be  desirable  to  have  an 
analytical  approach  for  their  determination  as  opposed  to  one  involving  pure 
numerical  computation.  We  propose  an  analytical  perturbative  scheme  towards  this 
purpose  in  the  present  paper.  It  is  a  systematic  procedure  and  can  be  improved 
successively.  Among  other  attempts  to  determine  a,  but  not  the  g(x),  by  analytic 
approximations,  we  may  mention  those  of  Helleman  (1983)  and  Hu  and  Mao  (1982) 
among  others. 

2.  Basic  equations  and  method 

Replacing  x  by  ax  we  get 


g(ax)  +  ag[g(x)]  =  0, 

(4) 

3(0)=  1. 

(5) 

g(x)  =  1  +  p(x), 

(6) 

00 

P(X>  =  Z 

n  —  1 

(7) 

y  =  x  z  Pi  +  o, 

(8) 

where  Z  is  the  order  of  local  maximum  of  the  transformation. 

Further  let 

00 

p(ax)=  Y,  C„l>  (*)]"•  (9) 

n=  1 

Using  (6)  and  (9)  in  (4)  we  obtain 

00 

1  +  Z  cn[p(x)]n  +  ag[l  +p(x)]  =  0.  (10) 

n  =  1 

Equating  the  coefficients  of  [p(x)j"  to  zero  in  (10)  we  obtain 

1  +a0(l)  =  0, 


(11) 
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and 

n\C„  +  a/(n)(l)  =  0  (n=  1,2,...).  (12) 

On  the  other  hand,  if  we  expand  both  sides  of  (9)  in  powers  of  y  and  equate  the 
coefficients  we  obtain  (n  =  1,  2,  .  .  .) 


PJ"  =  I  ClPm,Pm2  ■  ■  ■  Pm,&mt+mi+  ...  +m,. 

where  ft  —  |a|z. 

We  display  first  few  of  these  equations. 


PlP  =  C,?!, 


PlP2  =  C1P2+C2P1 . 

P3  P2  =  Cip3+C2(2plp2)  +  C3pl, 

P*P 4  =  Clp4  +  C2(2p1p3  +  p2)  +  C3(3pfp2)  +  C4pt, 


(13) 


(14) 

(15) 

(16) 
(17) 


We  note  that  since  px  ±  0, 


Q  = 


a 


It  is  also  useful  to  note  that 


/('»d)  = 


(rZ)\ 

l\(rZ-l)\  Pr 


for  /  =  1,  2,  3,  ...  ,  and 

00 

/(»=  1+  I  Pr- 

r=  1 

If  we  combine  (11),  (13),  (19)  and  (20),  and  further  define 

P„a"  =  S„|a|z 
n  =  1,  2,  .  .  . 


We  obtain  the  basic  equations  to  be  solved 


I+i  + 

a 


a 


I  §  =  ° 


,  CL 


and 


sn  +  X 


(rZ)!  Sr 


Sm  Sm 

m  1  w2 


m. 


U 


l\(rZ  —  /)!  a' 


a|z("-/) 


=  0 


for  n  >  1. 

The  summation  in  (23)  is  over  the  set  U  given  by 


m1  ^  1,  m2  ^  1,  .  .  .  ,  ^  1 

and 

mj  +m2  +  ...  +mt  =  n. 


(18) 

(19) 

(20) 

(21) 

(22) 

(23) 


P—  i'3f 
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We  now  note  that  (23)  for  n  —  1  leads  to 


and  since  S1  ±  0  we  get 


(24) 


In  fact  using  this  equation  we  can  simplify  the  remaining  equations  (23)  for  n  ^  2  to 
read 


I 


..S  A 


X 


mi  >  1,  .  .  ■  ,  m,  >  l 


=  0 


a  z(»-0 


for  n  —  2,  3,  .  .  . 


(25) 


Equations  (22)  and  (24)  and  (25)  together  constitute  a  system  of  coupled  nonlinear 
equations  to  determine  a  and  5 j,52,  .  .  .  We  regard  (22)  as  the  eigenvalue  equation  for  a 
while  (24)  and  (25)  are  to  be  used  to  determine  Sls  S2, .  .  .  in  terms  of  a. 

These  equations  are,  however,  highly  nonlinear  and  coupled.  One  needs  a  systematic 
procedure  for  their  solution.  We  now  notice  that  (24)-(25)  allow  us  to  conclude  that,  as 
a  -+  oo,  we  have  a  solution  for  which  Sn  tends  to  constants  Sn  0  i.e. 


In  particular  S„’s  admit,  as  a  -►  oo,  an  expansion  in  the  inverse  powers  of  «,  of  the  form, 
for  Z  a  nonrational  number, 


SM  =  I 


p.  q  =  0,1,2,... 


where  Sn  p  q  are  numerical  constants.  When  Z  is  integral  or  a  rational  number  simpler 
expansions  can  be  written  down. 

We  propose  to  use  such  a  ->  oo  expansions  and  use  them  to  S„(a)  in  terms  of  a  by 
using  (24)  and  (25).  These  expansions  for  S„(a)  when  substituted  in  (22)  would  give  us 
the  eigenvalue  equation  for  the  determination  of  a. 

It  is  known  that  as  Z  -►  1,  a  does  tend  to  infinity.  We  thus  expect  this  procedure  to 
clearly  work  when  Z  -►  1  (Collet  et  al  1980;  Derrida  et  al  1978, 1979).  We  will  see  in  the 
next  section,  where  we  apply  it  to  the  physically  most  interesting  case  i.e.  Z  —  2  that  the 
method  is  still  quite  effective. 

3.  Application  of  the  method  for  Z  =  2 

We  shall  now  apply  our  method  for  the  Z  =  2  case.  In  this  case  we  can  use  the  simpler 
expansion  given  by 
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SM  =  I 

m-  0,  1 ,  2, 


’n,  m 


a 


m 


(26) 


Using  (24)  we  obtain,  as  a  ->  oo 

S,(a)--l/2.  (27) 

Similarly  from  (5)  and  using  (27)  we  obtain  the  following  leading  behaviours,  as  a  -»  oo 


S2"V  s3 


1 


1 


etc. 


(28) 


16a’  128a 

Proceeding  further  and  equating  powers  of  a  on  both  sides  of  (24)  and  (25)  we  obtain 


0 

=  Si, 

1  + 

2S2, 

0 

0 

=  S2> 

1  + 

3  Si. 

oSi.  i  +6(Slt 

o)2^2,  0> 

0 

=  S|, 

2  + 

2S2, 

i  +  353;  o, 

0 

=  s2. 

2  — 

S 2 ,  0 

+  (S1.o)2[Sli 

2  +  6S2  J  +  15S3)  o] 

+  2iSj 

1  > 

oSi. 

i  [Si,  i  4"  6 S2 

0]  4  [2Si,  0^1,  2  +  Sf 

0 

-Si. 

3  + 

252, 

2  4-  353  j  +  4S4,  o»  elc* 

(29) 


Combining  the  information  contained  in  (28)  with  that  contained  in  (29)  we  obtain 
Si,o=  -1/2; 

Si,!=  -1/4,  S2. o  =  l/8; 

^1,2  =  s2,  i  =  o  —  0; 

Si,  3  =  -  1/4,  S2, 2  =  1/32,  S3i !  =  1/16,  S4i  o  =  0;  etc.  (30) 

Using  these  coefficients  we  obtain  the  approximation 


/all  \  -  (l  1 

3(x>=1-(  2  +  4  +  4?+  •■•)*  +  (  8  +  32?  + 


x 


+ 


1 


16aJ 


+  .  .  :  +  . 


(31) 


where  all  the  coefficients  of  the  powers  of  x2  have  been  kept  to  the  order  (1/a2). 
The  eigenvalue  equation  is 

J{<x)  =  0, 

1  00  S 

where  J( a)  =  -  +  1  +  a  V  — /j  and  we  have 
a  „  =  i  a 


(32) 


a  7  1  5 

J(a)  =  --  +  -H - — — y  +  0(l/a  ). 

oo  2  8  a  32a 

To  this  order  in  approximation  we  obtain  from  (32)  using  (33) 
a  =  5/2  =  2-5000 


(33) 


(34) 


to  be  compared  with  the  result  of  the  “exact”  numerical  computation  a  =  2-5029  .... 
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Using  the  approximate  value  a  =  5/2  we  get  from  the  expression  (31), 

0(x)«  1  -l-5400x2  +  0-1300x4  +  0-0100x6  +  ....  (35) 

We  also  quote,  for  comparison,  the  result  of  the  “exact”  numerical  computation 
(Feigenbaum  1979) 

g(x)&  1 -l-5276x2  +  0-1048x4  +  0-0267x6-0-0035x8  +  .  .  .  (36) 

where  we  have  kept  only  the  four  figures  after  the  decimal  sign. 


4.  An  exact  solution 


An  exact  solution  of  (4)  is  given  by 


g(x)  =  (1  +|x|z)1/z, 

1 


a  = 


2vz- 


(37) 

(38) 


This  solution  is,  however,  of  no  interest  for  the  Feigenbaum  scenario  for  the  onset  of 
the  chaotic  behaviour  since  g(x )  has  a  local  minimum  rather  than  local  maximum  at  x 
=  0.  It  however  implies  that  the  eigenvalue  equation  for  a  should  have  a  root  given  by 
(38).  For  Z  =  2  we  should  thus  have  a  root  at  a  =  —  1  /y/2  «  —  0707  ...  for  the  exact 
eigenvalue  equation.  Our  approximate  eigenvalue  equation  given  by  (32)  and  (33)  i.e. 


a  7  1  5 

2  +  8  +  a  32a2 


has  a  negative  root  at  a  =  —  (y/l7  +  3)/8  %  —  0-8904  which  presumably  a  reflection  of 
the  root  a  0-707  ...  for  the  exact  equation. 


5.  Concluding  remarks 

We  thus  see  that  the  present  method  produces  reasonably  accurate  results.  Since  the 
method  is  a  systematic  one  the  accuracy  can  be  improved  by  taking  higher  order 
approximations. 

The  real  utility  of  the  method  would  be  to  further  allow  us  to  calculate  the 
Z-dependance  of  the  universal  parameter  a.  Moreover  whenever  one  needs  g(x)  as  an 
input,  such  as  for  example  in  the  equations  for  h(x),  it  is  useful  to  have  such  analytic 
approximations  to  g(x)  as  we  have  obtained  here.  These  and  similar  investigations  will 
be  reported  later. 
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Quantum  chaos  and  fluctuation  properties  of  regular  and  irregular 
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Abstract.  Fluctuation  properties  of  the  regular  and  irregular  energy  levels  for  the  Henon- 
Heiles  Hamiltonian  are  examined.  The  spacing  distributions  and  the  calculated  values  of  the 
A3-statistic  show  that  there  is  no  difference  in  the  short  range  correlation  properties  of  these 
spectra.  Remarkably,  the  A3  values  agree  with  the  results  of  random  matrix  theory. 

Keywords.  Quantum  chaos;  fluctuation  properties;  regular  spectra;  irregular  spectra; 
spectral  signature;  spacing  distribution. 

PACS  No.  05*40;  24*60;  46*60 


1.  Introduction 

During  the  last  few  decades  much  progress  has  been  made  in  understanding  the  (time) 
evolution  of  classical  dynamical  systems  in  phase  space.  In  particular  the  studies  have 
provided  a  classification  scheme  for  dynamical  systems  based  on  how  “regular”  or 
“erratic”  the  motion  is  in  phase  space.  As  a  result  with  increasing  “irregularity”  of 
motion,  the  systems  are  in  turn  labelled  periodic,  quasi-periodic,  ergodic,  mixing  and 
chaotic  (K-systems).  An  account  of  the  earlier  work  on  dynamical  systems  has  been 
given  by  Khinchin  (1949)  and  the  more  recent  formal  developments  are  described  by 
Arnold  and  Avez  (1968),  Ornstein  (1974),  Arnold  (1978)  and  Krylov  (1978).  Ford  (1973) 
reviewed  these  developments  in  a  more  physical  and  intuitive  way. 

In  this  paper  our  interest  is  in  “regular”  and  chaotic  motion.  It  is  therefore  worth 
recalling  that,  for  regular  (periodic  and  quasi-periodic)  motion  the  system  remains 
confined  to  a  restricted  region  (in  general  invariant  tori)  of  the  total  available  phase 
space.  Also  recall  that  for  a  chaotic  system  the  motion  is  over  the  entire  available  phase 
space;  it  is  extremely  “erratic”  and  it  shows  random  behaviour.  This  occurs  in  spite  of 
the  fact  that  the  system  is  fully  (all  degrees  of  freedom)  described  and  the  equations  of 
motion  governing  its  evolution  are  completely  deterministic.  Although  many  aspects  of 
“regular”  and  chaotic  motion  in  classical  systems  are  fairly  well  understood,  the  way  in 
which  they  are  expressed  in  the  corresponding  quantum  systems  are  not  always  equally 
clear.  (For  reviews  on  “quantum  chaos”  see  Zaslavsky  1981  and  Bohigas  and  Giannoni 
1984). 

The  purpose  of  this  paper  is  to  discuss  one  aspect  of  the  “quantum  chaos”  problem 
that  has  been  extensively  studied  in  recent  years.  More  precisely,  we  describe  how 
“regular”  and  chaotic  motion  in  a  bound  classical  Hamiltonian  system  will  be  reflected 
in  the  discrete  energy  spectrum  of  the  corresponding  quantum  system.  The  suggestions 
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in  this  respect  have  included  (i)  examining  properties  of  individual  energy  levels 
(Percival  1973)  and,  (ii)  examining  statistical  (fluctuation)  properties  (Zaslavsky  et  al 
1974;  Zaslavsky  1977)  of  a  whole  set  of  levels. 

In  view  of  these  suggestions,  we  summarize  in  §  2  results  of  earlier  studies  on  spectral 
signatures  which  characterize  quantum  “regular”  levels  and  quantum  “irregular” 
(chaotic)  levels.  We  also  point  out  (what  in  our  view  is)  an  omission  in  all  studies 
involving  fluctuation  properties.  It  turns  out  that  this  has  a  bearing  on  signatures  which 
are  a  consequence  of  statistical  properties  of  energy  levels. 

Section  3  contains  a  brief  description  of  our  calculations  and  results  for  the  Henon- 
Heiles  (Henon  and  Heiles  1964)  model  Hamiltonian.  A  summary  and  concluding 
remarks  are  presented  in  §4. 


2.  Spectral  signatures  of  regular  and  irregular  levels 

Following  Percival’s  work  (Percival  1973)  there  are  several  studies  proposing 
signatures  in  the  energy  spectrum  that  will  distinguish  between  the  “regular”  levels 
(classically  quasi-periodic  motion  in  phase  space)  and  the  “irregular”  levels  (classically 
chaotic  motion).  The  basic  conclusion  is  that  “irregular”  levels  correspond  to  large 
second  differences  (Pomphrey  1974)  or  “avoided”  crossings  (Noid  et  al  1980)  in  energy. 
This  is  not  the  case  for  “regular”  levels.  Further,  it  appears  (Pullen  and  Edmonds  1981) 
that  large  second  differences  correspond  to  “avoided”  crossings. 

Another  suggestion  (Zaslavsky  et  al  1974;  Zaslavsky  1977)  has  been  to  study  the 
distribution  of  energy  spacings  between  neighbouring  levels.  It  has  been  shown  that,  for 
the  “regular”  spectrum,  this  distribution  (Berry  and  Tabor  1977)  is  of  the  Poisson  form, 
and  for  the  “irregular”  spectrum  (McDonald  et  al  1979;  Casati  et  al  1980;  Pechukas 
1983;  Bohigas  et  al  1984;  Haller  et  al  1984;  Seligman  et  al  1984)  it  is  essentially  of  the 
Wigner  form  due  to  repulsion  between  neighbouring  levels  belonging  to  the  same 
(exact)  symmetry. 

It  should  be  stressed  however  that  for  any  interacting  system  neighbouring  levels 
belonging  to  the  same  exact  symmetry  will  repel  each  other.  Since  for  “irregular”  levels 
there  are  no  other  quantum  numbers  (constants  of  motion)  except  energy,  level 
repulsion  is  to  be  expected  in  their  spacing  distribution.  For  “regular”  motion  if  the 
system  (having/ degrees  of  freedom)  is  integrable  there  are  (/—  1)  other  constants  of 
motion  besides  energy.  Even  if  the  system  is  not  integrable,  for  regular  motion  it  follows 
from  Kolmogorov-Arnold-Moser  (kam)  theorem  (see  e.g.  Arnold  and  Avez  1968)  that 
the  phase  space  trajectory  will  be  on  a  deformed  torus.  This  suggests  that  in  this  case 
also  there  ought  to  be  in  addition  to  energy  (/—  1)  other  constants  of  motion.  It  is  not 
clear  at  all  how  to  determine  these  additional  constants  of  motion  for  a  non-integrable 
system.  All  the  same  it  is  crucial  that  they  (actually  the  corresponding  quantum 
numbers)  be  considered  when  obtaining  the  fluctuation  properties  of  “regular”  levels.  If 
this  is  not  done,  the  resulting  properties  would  very  likely  be  a  consequence  of  a 
superposition  of  uncorrelated  level  sequences.  We  believe  that  the  Poisson  spacing 
distribution  for  the  “regular”  levels  is  an  artefact  arising  from  just  such  a  superposition 
of  uncorrelated  energy  levels.  This  is  because  no  other  quantum  numbers  except  energy 
are  taken  account  of  in  the  earlier  studies. 
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3.  Fluctuation  properties  of  Henon-Heiles  Hamiltonian 

Keeping  in  view  these  considerations,  we  determine  in  this  section  the  fluctuation 
properties  of  the  energy  eigenvalues  of  the  modified  Henon-Heiles  (Henon  and  Heiles 
1964;  Pullen  and  Edmonds  1981)  Hamiltonian.  The  reason  for  choosing  this 
Hamiltonian  is  that  it  has  only  two  degrees  of  freedom,  but  more  important  has  been 
the  fact  that  in  our  knowledge  it  is  the  Hamiltonian  which  has  been  most  extensively 
studied  in  connection  with  classical  and  quantum  chaotic  behaviour. 

For  this  Hamiltonian,  we  have  obtained  the  nearest  neighbour  spacing  distribution 
and  the  A3-statistic  of  Dyson  and  Mehta  (1963).  For  evaluating  these  properties  we 
have  very  closely  followed  the  approach  of  Haq  et  al  ( 1 982).  We  would  like  to  determine 
(i)  if  there  is  a  difference  between  the  fluctuation  properties  of  the  “regular”  and  the 
“irregular”  spectrum,  and  (ii)  how  these  properties  compare  with  the  prediction  of 
random  matrix  theory. 

As  we  shall  see,  we  are  able  to  provide  unambiguous  answers  using  both  the  spacing 
distribution  and  the  A3-statistic. 

We  briefly  describe  our  calculations  next.  The  model  Hamiltonian  (Pullen  and 
Edmonds  1981)  is 

H  =i  (Px+Py  +x2  +  y2)  +  a(x2y-l/3y3),  (1) 

where  a  is  the  strength  parameter.  Note  that  the  anharmonic  term  has  a  3-fold 
symmetry  ( C3v  point  group)  (Pullen  and  Edmonds  1981).  We  have  separately  calculated 
the  eigenvalues  for  each  of  the  3-irreducible  representations  (labelled  At,  A2,  E)  for 
a  =  0  088.  This  particular  value  was  used  by  Pomphrey  (1974)  and  Pullen  and 
Edmonds  (1981).  The  diagonalization  procedure  used  (reducing  to  tridiagonal  form 
followed  by  bisection)  was  the  same  that  Pullen  and  Edmonds  followed.  The  number  of 
eigenvalues  we  obtain  for  each  symmetry  below  the  escape  energy  (l/6a2  =  21  *52)  is 
shown  in  table  1 .  The  convergence  of  the  eigenvalues  was  good  to  four  significant  places 
when  enlarging  the  basis  space.  For  each  symmetry  we  identified  the  “regular”  and  the 
“irregular”  levels  by  considering  table  2  of  Pullen  and  Edmonds.  It  turns  out  then  that 
most  of  the  levels  below  E  -  Ec  ( Ec  —  14*5—1 7*5)  belong  to  the  “regular”  spectrum  and 
those  with  E  >  Ec  belong  to  the  “irregular”  spectrum.  Ec  is  different  for  the  three 
symmetries.  We  then  find  that  out  a  total  of  1 71  levels,  87  belong  to  the  regular  part.  The 
remaining  84  levels  characterize  irregular  levels. 

The  fluctuations  (Porter  1965;  Mehta  1967;  Brody  et  al  1981)  in  levels  are  (by 
definition)  departures  from  a  smooth  uniform  spectrum.  Since  for  our  model  H  the 


Table  1.  Number  of  eigenvalues  for  a 
=  0  088  for  each  symmetry  below  the  escape 
energy  (=  21-52). 


Symmetry  type 

Number  of  eigenvalues 

Al 

50 

A  2 

37 

E 

84 

Total 

171 
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level  density  is  not  a  constant,  it  is  essential  to  “unfold”  (Brody  et  al  1981)  the  spectrum 
and  map  it  to  one  with  constant  density.  The  mapping  we  have  chosen  is 

F  (E)  —  a  +  bE  +  cE2.  (2) 


This  is  a  natural  choice  because  the  quadratic  term  is  the  distribution  function 
(continuous  approximation)  for  the  unperturbed  oscillator.  The  parameters  a ,  b,  c  were 
determined  separately  for  each  symmetry  by  a  best  fit  to  the  calculated  energy  levels. 

We  very  briefly  discuss  next  the  basic  definition  and  physical  meaning  of  the  spacing 
distribution  p  (x)  and  the  A3-statistic.  p  (x)  dx  is  the  probability  for  the  spacing  between 
two  levels,  with  no  levels  lying  between  them,  to  take  a  value  between  x  and  dx.  Note 
that  x  is  measured  in  units  of  constant  mean  spacing  and  J  ®  p  (x)  dx  =  1.  As  described 
in  detail  elsewhere  (Bohigas  and  Giannoni  1975)  the  expressions  for  A3  can  be  reduced 
(for  purposes  of  calculations)  to 


A3  (n) 


N2  1 


+  ~  t  (N-2i+\)E, 
n  it i 


In  (3)  n  denotes  a  fixed  energy  interval  in  units  of  (constant)  mean  spacing  and  N  is 
the  actual  number  of  energy  levels  in  this  interval.  The  N  levels  have  energies 
E(  (i  =  1, .  .  .  N).  Physically,  A 3(n)  provides  a  measure  of  departure  of  the  exact 
eigenvalue  distribution  function  (staircase  curve)  from  a  straight  line.  It  should  be 
recalled  that  (Dyson  and  Mehta  1963)  for  the  gaussian  orthogonal  ensemble  (goe)  of 
real  symmetric  matrices,  the  mean  A 3(n)  over  the  ensemble  is  known  for  n  ->  oo.  For 
finite  values  of  n  it  has  been  obtained  by  Bohigas  et  al  (1983)  using  Monte-Carlo 
calculations. 

The  results  of  our  calculations  for  p  (x)  and  A3  are  discussed  next. 


3.1  Spacing  distributions 

When  a  =  0,  the  system  (equation  (1))  is  integrable  and  one  has  only  a  “regular” 
spectrum.  The  energy  levels  (two-dimensional  harmonic  oscillator)  are  labelled  by  the 
total  number  of  quanta  n  and  “angular  momentum”  /,  where  n  —  0, 1,  2, ...  ,  and  for  a 
fixed  n,  l  =  —  n,  —  n  +  2, .  . .  n  —  2,  n.  Note  that  all  levels  for  a  fixed  n  are  degenerate. 
Clearly,  the  spacing  distribution  of  levels  with  a  fixed  /  is  a  <5-function,  which  shows 
extreme  level  repulsion  and  rigidity.  One  would  obtain  a  Poisson  distribution  if  one 
superimposes  many  uncorrelated  /-sequences. 

When  a^O,  the  system  is  not  integrable  and  classically  one  has  either  regular  motion 
(kam  regime)  or  chaotic  motion  in  phase  space.  In  the  former  case,  the  motion  in  phase 
space  is  confined  to  a  two-dimensional  surface  (deformed  torus).  Hence,  there  ought  to 
be  another  “local”  constant  of  motion  (quantum  number)  besides  energy  for  the 
“regular”  levels.  This  quantum  number  should  be  included  in  the  analysis  of  fluctuation 
properties  of  the  “regular”  levels.  If  this  is  done,  then  since  the  system  (in  C3v  oscillator 
basis)  is  an  interacting  system,  one  would  expect  level  repulsion  for  both  “regular”  and 
“irregular”  levels.  Our  results  (without  the  additional  quantum  number  for  “regular” 
levels)  are  shown  in  figure  1.  They  show  level  repulsion  for  both  “regular”  and 
“irregular”  levels.  These  results  are  different  from  those  of  Haller  et  al  (1984)  and 
Seligman  et  al  (1984)  for  the  “regular”  levels.  One  possibility  for  the  difference  is  that 
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Figure  1.  Spacing  distributions  for  regular  levels  (figure  la)  and  irregular  levels  (figure  lb). 
Histograms  denote  the  calculated  distributions  for  the  Henon-Heiles  potential.  Smooth  curves 


n 


denote  the  theoretical  (Wigner)  distribution  p(x )  =  —  x  exp 


compared  to  our  case,  they  may  have  a  much  larger  number  of  uncorrelated  sequences 
in  their  calculations. 


3.2  A  ^-statistic 

We  have  evaluated  A3  (h)  for  0  <  n  ^  10  by  separately  taking  the  spectral  average  over 
the  “regular”  and  the  “irregular”  levels  belonging  to  each  symmetry,  and  then  taking  the 
mean  (“ensemble”  average)  over  the  three  symmetry  subspaces.  The  results  are  shown 
in  table  2. 

The  result  of  A3  («)  for  n  ^  5  clearly  shows  no  discernible  difference  between  the 
values  for  the  “regular”  and  “irregular”  spectrum.  For  values  of  n  >  5  the  difference  is 
«  10%  but  then  the  statistics  are  not  as  good  as  those  for  n  <  5.  The  agreement  with 
the  theoretical  prediction  of  goe  is  also  very  good  for  small  n  and  deteriorates  to 
15-20  %  difference  for  larger  n.  There  is  however  no  agreement  between  the  calculated 
A3  (n)  and  the  Poisson  distribution  values. 

In  view  of  the  numerical  results,  for  A3,  it  seems  reasonable  to  conclude  that 
whenever  we  have  been  able  to  carry  out  sharpest  (best  statistics)  comparisons,  there  is 
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Table  2.  Values  of A3(n)  for  regular,  irregular,  goe  (from  Bohigas  et  al 
1983)  and  Poisson  spectra  (A3(n)  =  n/15). 


n 

a3(«) 

regular 

irregular 

GOE 

Poisson 

0-25 

00169 

0-0169 

0-0165 

0-0167 

0-50 

00326 

0-0334 

00325 

0-0333 

0-75 

0-0475 

0-0489 

0-0471 

00500 

10 

0-061 1 

0-0630 

00605 

00667 

20 

0-1127 

0-1147 

0-1023 

0-1333 

3-0 

0-1535 

0-1506 

0-1320 

0-2000 

40 

0-1807 

0-1770 

0-1549 

0-2667 

50 

01927 

0-1953 

0-1735 

0-3333 

60 

0-1977 

0-2100 

0-1893 

0-4000 

7-0 

0-2025 

0-2204 

0-2028 

0-4667 

8-0 

0-2077 

0-2267 

0-2148 

0-5333 

90 

0-2120 

0-2340 

0-2255 

0-6000 

10-0 

0-2140 

0-2379 

0-2356 

0-6667 

no  difference  between  the  fluctuation  properties  of  regular,  irregular,  and  goe  spectra. 
Thus  such  fluctuation  measures  (statistic)  are  not  likely  to  provide  a  signature  for 
distinguishing  between  the  “regular”  and  the  “irregular”  spectra.  It  is  also  remarkable 
that  a  system  with  just  two  degrees  of  freedom  leads  to  fluctuation  properties  that  are 
identical  to  goe — essentially  a  parameter-free  theory  originally  proposed  for  a  system 
with  a  larger  number  of  degrees  of  freedom. 


4.  Summary  and  concluding  remarks 

Our  results  show  no  significant  difference  in  the  fluctuation  properties  of  regular, 
irregular  and  goe  spectra.  As  stated  before,  it  is  not  clear  why  our  results  for  “regular” 
spectra  do  not  agree  with  those  of  Haller  et  al  (1984)  and  Seligman  et  al  (1984).  While 
our  statistics  are  small  compared  to  theirs,  it  seems  unlikely  that  our  conclusions  would 
be  altered  in  a  significant  manner  with  improved  statistics.  For  this  purpose  further 
calculations  are  currently  in  progress.  The  agreement  between  the  results  for  “irregular” 
spectra  and  goe  is  consistent  with  earlier  results  and  seems  to  suggest  (Bohigas  et  al 
1984)  universality  in  the  behaviour  of  fluctuation  properties — independent  of  the 
number  of  degrees  of  freedom. 
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Self-energy  effect  in  a  relativistic  bound  state  problem 
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Department  of  Physics,  University  of  Delhi,  Delhi  110007,  India 

Abstract.  A  modification  of  the  Wick-Cutkosky  equation  for  the  relativistic  bound  state  of 
two  scalar  particles  interacting  through  the  exchange  of  a  massless  scalar  field  within  the  ladder 
approximation  has  been  considered  by  incorporating  the  self-energy  diagrams  in  the  integral 
kernel.  An  exact  analytical  solution  of  the  equation  is  obtained  at  vanishing  total  energy  and  it 
is  shown  that  the  self-energy  effects  generally  diminish  the  eigenvalues  in  agreement  with  the 
findings  of  Li  et  al,  who,  however  solved  the  equation  numerically  for  the  case  of  massive  scalar 
exchange.  An  additional  feature  of  the  modified  equation  is  that  it  preserves  the  0(5)  symmetry 
at  zero  total  energy  as  was  first  noted  by  Cutkosky  for  the  scalar  bound  state  equation  without 
self-energy  effects. 

Keywords.  Bethe-Salpeter  equation;  relativistic  bound  state  problem;  self-energy  effects. 

PACSNo.  1110 


1.  Introduction 

The  covariant  wave  equation  originally  formulated  by  Bethe  and  Salpeter  (1951)  more 
than  thirty  years  ago  still  remains  the  main  tool  to  study  the  bound  state  problem  in  the 
context  of  quantum  field  theory.  In  momentum  space  it  is  given  by  an  integral  equation 
with  integral  kernel  determined  by  four-point  irreducible  Feynman  diagrams  as  well  as 
by  self-energy  diagrams.  In  the  absence  of  a  complete  knowledge  of  all  these,  the  ladder 
approximation  is  generally  used.  In  this  context  the  studies  of  Wick  (1954)  and 
Cutkosky  (1954)  for  a  scalar  Bethe-Salpeter  equation  (bs  equation)  have  revealed  many 
interesting  consequences  of  a  relativistic  bound  state  problem.  Later  Blanckenbecler  and 
Sugar  ( 1 966)  have  pointed  out  the  effect  of  unitarity  condition  on  the  bs  equation  and  in 
particular  Levine  and  Wright  (1967)  have  shown  that  the  unitarity  relation  namely 
<7(0tai  ^  ^inelastic  can  violated  i n  the  inhomogeneous  bs  equation  describing  scattering 
processes  if  only  the  ladder  diagrams  are  taken  into  the  integral  kernel.  Inclusion  of  self¬ 
energy  diagrams,  do,  however,  remove  this  difficulty.  Recently  Li  et  al  (1980)  have 
considered  the  influence  of  the  self-energy  diagrams  on  the  solutions  of  the  bound  state 
equation  involving  scalar  particles.  Briefly,  they  considered  a  scalar  bs  equation  with  the 
interaction  chosen  as  =  g(p*  (x)</>!  (x)</>2(x),  where  t/q  is  a  complex  scalar  field 
with  mass  m  and  (j)2  is  a  neutral  scalar  field  with  mass  p.  In  the  second-order  the  integral 
kernel  of  the  equation  has  the  form 

~2  1 - ,l2  ,  2  ~  ^*(P)(Pi  +  m2)(Pi  +  m2)J4(P  -  P')>  (1) 

n2  (p-p)2  +  p2 

where  h(p)  stands  for  the  self-energy  contribution  to  the  kernel  and  px  and  p2  are 
respectively  p  +  2P  and  —  P  +  iF.  P ^  is  the  centre-of-mass  four-momentum  of  the 
composite  system.  It  is  well  known  that  for  nonvanishing  p  the  scalar  bs  equation,  even 
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with  the  first  term  of  the  kernel  given  by  (1),  cannot  be  solved  analytically.  Li  et  al  (1980) 
adopted  a  comprehensive  numerical  study  of  the  problem  with  the  integral  kernel  given 
in  (1).  Their  calculations  show  that  self-energy  diagrams  in  general  diminish  the 
eigenvalues  and  also  alter  the  wave  functions  slightly. 

In  this  note  we  point  out  that  with  a  little  modification  of  the  kernel,  exact  analytic 
solution  of  the  problem  exists  for  some  given  values  of  total  energy  and  that  the 
properties  of  the  eigenvalues  with  respect  to  the  coupling  parameter  can  be  easily 
determined.  Li  et  al  (1980)  have  discussed  in  detail  through  their  numerical  analysis  the 
eigenvalues  of  the  coupling  parameter  for  various  non-vanishing  values  of  p  and  for  a 
given  set  of  values  of  the  energy  parameter  rj  —  E/2m  where  E  is  the  fourth  component 
of  PM.  In  order  to  obtain  analytic  solutions  we  modify  the  kernel  as 

%  7 — ^72  -  Mp)(Pi  +  m2)(pl  +  m2)S4(p  —  p').  (2) 

n  (P~P) 

We  thus  consider  the  situation  when  only  massless  scalar  particles  are  exchanged, 
however,  the  mass  of  the  neutral  scalar  contributing  to  the  self-energy  expression  h{p) 
need  not  be  zero  and  can  be  taken  at  will.  The  equation  we  thus  consider  is  a  simple 
generalization  of  the  original  Wick  (1954)-Cutkosky  (1954)  equation  together  with  an 
additional  term  in  the  kernel  due  to  self-energy  contribution  to  second  order  in 
coupling  strength  g{2  «  g2). 


2.  Separability  of  the  modified  Wick’s  equation 


The  modified  Wick’s  equation  which  we  like  to  solve  reads 


(p2i+m2)(pl  +  m2)x(P)=  ~  ^2  P(pL  +  ^(p)(Pi  +  m2)(pj  4-  m2)X(p),  (3) 

when  both  momentum  p  and  p'  are  in  Euclidean  space,  and  the  self-energy  function  h(p) 
is  given  by 


h(p)  = 


r°°  (p2+m2)[cr2-(m  +  p)2]1/2[<72-(m-/i)2]1/2  __  _2 


(m  +  f*)2  a2(a2 -m2)2(pi  +  a2) 

+  (Pl  ->P2) 


dcrJ 


We  rewrite  (3)  as 
*A(p)  = 


71 


d  4PV(P') 


(P  - P')2(Pi2  +  ">2)(P22  +  m2)[l  -  2/i(p')] 


(4) 


(5) 


where  p\  and  p'2  are  respectively  equal  to  p'  +  {E/2)  and  —  p'  +  {E/2)  and  \l/{p)  =  [1 
-  A/i(p)]  {p\  +  m2)  {p\  +  m2)x{p).  Hereafter,  we  will  assume  that  the  3-vector  part  of  P M 
is  a  null-vector  for  simplicity.  Equation  (5)  can  also  be  written  as  a  differential  equation 
i.e. 


B\p{p)  -  -4A 


<A(p) 

{P2i  +m2){pl  +  m2) 


1 

1  —  2h{p)  ’ 


and 


□  =  {d2/dpl)+{d2/d  p2), 


(7) 
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We  now  show  that  (6)  is  separable  in  terms  of  new  (Green  1957;  Biswas  1967)  variables 
defined  by 


px  =  ps  sin  6  cos  </>, 

p2  =  ps  sin  6  sin  </>,  (8) 

p3  =  cos  0, 


and  further,  ps  —  C  sin  t]/ (cosh  a  —  cos  rj)  and  Px  —  C  sinh  a/(cosh  a  —  cos  f/). 

The  whole  energy-momentum  plane  is  contained  in  the  region  0  ^  0  ^  7c,  0  ^  ^  2rr 

and  —  go  ^  a  ^  oo  and  the  parameter  C  is  given  by 

C2  =  (E/2)2  +  m2  (9) 

In  terms  of  these  new  variables,  the  self-energy  function  h(p)  becomes  a  function  of  the 
variable  a  only.  For  instance,  introducing  a  quantity  c'  with 

C'2  =  {E/2)2  -1-  o2.  (10) 

The  function  h(p)  can  be  written  as, 


h{p)  =  2 


ao 


l(m  +  n)2 


[cr2  —  (m -f  /i)2]1/2[<t2  —  {m  —  p)2y12  C 

a 


2\2 


o2{o2-m z) 


CC  —  E2 / 4  tanh2a 
C,2-E2/ 4  tanh2a 


=  A  (m,  p,  tanh  a) 


(11) 


In  the  new  variables  defined  by  (8)  separable  solutions  are  effected  by  writing  \p(p)  as 


<p ip)  =J(n)g()i)Ylm{t),  <p)/ps.  (13) 

Substituting  (13)  in  (6)  we  obtain  after  separation  of  variables  the  following  two 
equation  for  g{t])  and  /(a)  namely, 


and 


d V  J  2  /(/+!) 

d^2  [  sin2  rj 


(14) 


d 2f 

da2 


A 

E2 

C2  cosh2  a  — —  sinh2  a 


1 

1  —  AA(m,  p,  tanh  a) 


/=  0, 


(15) 


where  n  is  a  separation  constant. 

We  thus  see  that  the  modified  Wick-Cutkosky  equation  which  includes  the  effect  of 
self-energy  insertions  in  the  integral  kernel  of  the  bs  equation  still  admits  separable 
solution.  In  §3  we  discuss  the  eigen  solution  of  the  problem  and  concentrate  on  the 
particular  case  E  =  0  which  affords  a  simple  exact  solution  of  (15)  determining  the 
eigenvalues. 


3.  Solution  at  E  =  0 

At  E  =  0  we  note  from  (11)  that  A(m,  p,  tanh  a)  assumes  a  particularly  simple  form: 
Here  A (m,  p ,  tanh  a)  becomes  independent  of  a  and  A  (m,  p,  tanh  a)  reduces  to  a  constant 


P 


4 
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depending  on  m  and  g  only  which  we  denote  by  A,  i.e. 
A (m,  g,  tanh  a)£  =  0  = 


(16) 


and  (15)  then  reduces  to 


d2/(a) 

da2 


A/(l-M) 
m2  cosh2  a 


►/(a)  =  0. 


(17) 


The  boundary  conditions  on  g(rj)  and  /(a)  in  the  new  coordinates  are  obtained  by 
substituting  them  in  the  original  integral  (9)  which  is  satisfied  if  #(0)  =  g(n)  and  /(a) 
vanishes  at  large  a  and  is  an  even  function  of  a.  Further  /  and  g  must  be  finite  within  the 
given  ranges  as  mentioned  at  the  end  of  (8). 

The  acceptable  solutions  of  (14)  consistent  with  the  above  mentioned  boundary 
conditions  are  the  well-known  Gegenbauer  polynomials, 


g{rj)  —  constant  x  sin 1  +  1  (rj)  •  Cj,L}_  x  (cos  g). 


(18) 


Equation  (17)  determines  the  energy  eigenvalues  of  the  bound  states.  Here,  for  a  given 
E ,  the  equation,  however,  determines  the  spectrum  of  values  of  X.  If  we  transform  (1 7)  in 
terms  of  a  new  variable  x  where  cos^  =  tanh  a,  the  solution  of  (17)  can  be  easily 
obtained  as 


/  =  constant  x  sin"  (x)  CnN+_lJ2  (cos  x), 


(19) 


where  the  integer  N  is  related  to  the  eigenvalue  X  by 


X 

m2(  1  —  AA) 


N(JV  +  1). 


(20) 


This  result  is  to  be  compared  with  the  eigen  spectrum  of  X  when  the  effect  of  self-energy 
is  not  present,  which  is  obtained  from  (20)  by  setting  A  =  0  on  the  left-hand  side.  The 
relevant  relation  is 

X/m2  =  N(N  +  \).  (21) 


Thus  when  E  =  0  and  the  scalar  particles  interact  through  ladder  diagrams  via  the 
exchange  of  massless  scalar  particle  the  self-energy  diagrams  for  A  >  0  will  diminish  the 
eigenvalue.  This  result  is  in  agreement  with  the  finding  of  Li  et  al  (1980),  who,  however, 
solved  the  problem  numerically  for  a  massive  scalar  field  exchange.  The  eigenfunctions 
in  the  present  case  when  we  consider  the  massless  scalar  particle  exchange  remain 
identical  both  for  equations  with  or  without  the  inclusion  of  self-energy  effects.  Li  et  al 
(1980)  found  a  slight  change  in  the  nature  of  the  eigenfunctions  in  these  two  cases. 


4.  Other  remarks 

If  we  collect  the  solutions  at  E  =  Oof  g(rj)  and  /(a)  from  (18)  and  (19)  we  see  from  (13) 
that  the  Bethe-Salpeter  wave-function  \j/  takes  the  form 

*  YNnlm  (22) 

where  YNnlm  are  the  spherical  harmonics  in  the  5-dimension  euclidean  space  and  are 
given  by 
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YNnim  =  (sin  x)nCnN+fn12  (cos x)(sin  rj)1  + 1  Cj,*  {_  x  (cos  r])-Ylm{6 ,  (p),  (23) 

where  N  >  n  >  l  +  1,  l  >  \m\  and  the  degeneracy  of  YNnlm  is  ^N(N  + 1)(2 N  +  1).  In 
other  words  the  bs  equation  even  in  the  presence  of  self-energy  effects  preserves  the  0(5) 
symmetry  (Biswas  1967)  originally  discovered  by  Cutkosky  (1954)  when  the  self-energy 
effect  was  not  included. 

At  E  f  0  the  energy-eigenvalue  equation  (17)  can  be  recast  in  the  following  form 


,  d2G  dG 

(\-Z2)^  +  2(n-l)Z-^  + 


Wm2) 


1 


dZJ 


d Z  '  1(1  -E2  +  E2Z2)  (1  —  2A(m,  g,  Z2)) 


Equation  (20)  reduces  to  Cutkosky’s  (1954)  equation  when  A  is  set  to  zero.  The 
equation  cannot  be  solved  exactly  except  at  special  limits.  Both  the  eigenfunctions  and 
eigenvalues  will  be  largely  altered  due  to  the  presence  of  the  factor  [1  —  2A(Z2)]  " 1  in 
the  third  term  which  makes  the  equation  more  singular  than  the  standard  Huen’s 
equation  (Erdelyi  1942,  1944). 
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Abstract.  In  a  recently  suggested  variational  quantum  field  theoretical  approach  the  angular 
momentum  and  isospin  properties  of  the  pion  field  surrounding  a  quark  bag  are  investigated 
using  the  Lagrangian  of  the  Cloudy  Bag  Model. 
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1.  Introduction 

In  the  last  few  years  various  methods  have  become  popular  in  order  to  solve  chiral  bag. 
chiral  solition  and  Skyrme  models  for  the  nucleon  and  delta  system.  Most  of  these 
procedures  (Chodos  and  Thorn  1975;  Brown  and  Rho  1979;  Vento  e:  al  1980;  Birse  and 
Banerjee  1984;  Kalbermann  and  Eisenberg  1984;  Jackson  and  Rho  1983;  Adkins  e t  al 
1983)  seek  solutions  assuming  from  the  start  the  so-called  hedgehog  form  (Skyrme 
1962)  because  the  formal  simplification  allows  an  exact  nonperturbative  treatment  of 
these  models  in  the  mean  field  approximation.  Actually  in  a  recent  paper  some 
justification  of  the  hedgehog  assumption  has  been  given  (Urbano  and  Goeke  1984c). 
There  it  has  been  shown  that  the  hedgehog  is  one  of  a  set  of  shapes  of  the  pion  mean 
field,  coupled  to  the  corresponding  quark  structure,  which  minimizes  the  energy  of  the 
Cloudy  Bag  Hamiltonian  in  the  nucleon  delta  sector.  The  proof  has  been  obtained  in  a 
recently  formulated  variational  quantum  field  theoretical  approach  (vqf),  whose 
details  can  be  found  in  Urbano  and  Goeke  (1984a,  b).  Unfortunately,  the  hedgehog 
approximation  for  the  bag-pion  system  violates  rotational  and  isospin  symmetries. 
This  means  that  the  corresponding  state  is  not  an  eigenstate  of  the  operators  of  total 
angular  momentum  J  and  total  isospin  T.  Thus  it  is  important  to  investigate  some  of  the 
J-  and  T-properties  of  the  hedgehog  and  related  solutions  of  the  chiral  bag  system.  This 
is  the  objective  of  this  paper  and  the  formalism  used  is  the  VQF-theory. 

The  paper  is  constructed  as  follows.  Section  2  reviews  the  VQF-theory  with  particular 
emphasis  on  its  mean  field  aspects,  §3  investigates  some  angular  momentum  and 
isospin  properties  of  the  hedgehog  solution.  The  corresponding  projection  formalism  is 
developed  in  §4  with  some  numerical  results  for  simplified  quark-pion  structures.  A 
summary  and  an  outlook  are  given  in  §  5. 


2.  The  VQF-theory 

The  detailed  formulation  of  the  vQF-approach  can  be  found  in  Urbano  and  Goeke 
( 1 984a.  b).  In  the  present  section  we  review  briefly  the  formalism  as  far  as  it  is  needed  for 
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the  following.  All  considerations  are  presently  done  using  the  Lagrangian  of  the  Cloudy 
Bag  Model  (Theberge  et  al  1980).  This  one  is  written  in  the  form 


H 


(r)a •  V qa(r)  +  B  ]0V 


+  £  J'd3kco(/c)a/  (k)aj(k) 


+ 


2/. 


d3r  £  qa{r)y5rta'  ■  <j>(T)qa(r)As 


(1) 


a  =  1 


A 

Here  <f>( r)  is  the  quantized  pion  field  operator 


4>M  = 


1 


(2n) 


3/2 


d3k 


V/2w(k) 


[u7(k)exp  (ik  •  r)  +  aj  ( k )  exp  ( —  ik  •  r)] 


(2) 


and  the  af  (k)[a,(k)]  create  (annihilate)  a  pion  with  momentum  k  and  cartesian  isospin 
component  j.  The  qa{r)  is  the  quark  field  operator,  where  the  index  a  indicates  the 
colour. 

In  the  nucleon-delta  sector  the  quarks  occupy  the  s-state  of  the  bag  with  the  familiar 
wave  function 


'I'M 


N_/jo(Or/R) 

4k  \  ij1  (Clr/R)aia)  ■  r 


where  |  %a  )  is  the  spin-flavour  state  of  the  quark  with  colour  a.  Apparently  the  most 
general  form  of  \x„ )  in  the  u-d  sector  is  given  by 


*<.(“)>  =  N(a){ai\u,  t  >  +a2|u,  1  >+a3|d,  T  >  +  a4|d,  l  >}, 


(4) 


where  u ,  d  indicate  up-  and  down-quarks  and  the  arrows  the  orientation  of  the  spin 
component.  The  N(ot)  is  the  normalization  constant  and  the  as  are  to  be  treated  as 
variational  parameters  for  the  quark  structure  of  the  baryon.  Due  to  symmetry  the  a’s 
are  the  same  for  all  three  colours. 

The  total  baryon  is  assumed  to  consist  of  the  quark  core  given  by  (3),  and  the  pion 
cloud  described  by  a  coherent  state 


0>.  (5) 

Here  the  N({)  is  a  normalization  factor  and  the  field  amplitudes,  £,  (k),  are  to  be  treated 
as  variational  parameters  for  the  pion  cloud. 

Using  (3)  and  (4)  the  total  trial  wave  function  of  the  baryon  reads 


|{>  =  ^({)exp 


I  d3/c£,(k)a/  (k) 


_  J 


!*(«.  o>  = 


For  a  fully  quantum  mechanical  treatment  the  |^(a,  £)>  has  to  be  projected  on  good 
spin  and  isospin  quantum  numbers: 

PjPj] *(«,  «>  _ 

<•/'(“> £)\PjPt\'H<x>  i))112  ’ 


I  >PjA<*-  &  >  = 


(7) 
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Here  Pj  and  PT  are  projection  operators  introduced  into  nuclear  physics  by  Peierls  and 
Yoccoz.  For  a  general  intrinsic  solution  (6)  with  eg  the  spin-flavour  structure  (4)  these 
projections  are  complicated  since  no  simplifying  symmetry  relations  can  be  used.  Thus 
projection  techniques  are  presently  only  used  for  special  spin-flavour  configurations  as 
reported  in  Urbano  and  Goeke  (1984b)  and  in  §5.  Here  we  first  investigate  the 
properties  of  the  intrinsic  solution  corresponding  to  the  mean  field  approximation. 


3.  Mean  field  approximation 


In  the  mean  field  approximation  the  energy  is  given  by  the  expectation  value  of  H 
between  the  states  (6),  yielding 

3  r 


E(a,Z)  =  EMn+  £ 


d3fc<o(fc){;(kK;(k) 


i  =  i  J 


3  r 

+  9  1 

j  =  1  « 


d3fc 

y/2  «B(k) 


ip{k)ij(k)  £  <  Xa \°{a)  ■  k  la  >  +  c.c 


a  =  1 


where  £MIT  =  3Q/R  +  4nB/?>R3  -Z/R  is  the  MiT-model  energy,  p{k)  =  3jfkR)/kR  is 
the  Fourier  transform  of  the  pion  source  density,  and  g  =  Cl/ {6 f(Cl  —  1)(27t)3/2}  is  a 
renormalized  coupling  constant. 

The  variation  of  (8)  with  regard  to  £*(k)  yields  SE/6£f( k)  =  0  and  hence 


^(k)  -  oAk)%m  f  <X>)K> ' kTf  u°(a)>' 

(9) 

Inserting  this  into  (8)  the  total  energy  becomes 

E{ol)  =  EmT  —  6ng2A  (a) 

• 

f»fcV2(k) 

o  °>2(k) 

(10) 

with 

a 

/4(a)=  £  |<z(«)|ffiTj|z(“)>|2- 

(11) 

i,j  =  i 


The  E{ol)  has  a  minimum  whenever  A(ol)  reaches  a  maximum.  As  shown  in  Urbano  and 
Goeke  (1984c),  evaluating  A( a)  using  (4)  yields 

A(a)  =  3-2N4(a){(|ai|2  +  |a2|2-|a3|2-|a4|2)2  +  4|a1af  +a2aj|2}. 

(12) 

One  sees  immediately  that  /((a)  <  3.  The  maximum  value  of  /4(a)  is  reached  taking,  for 
instance,  a,  =  a4  =  0  and  a2  =  -a3  =  1.  This  particular  choice  corresponds  to  the 
quark  spin-isospin  state 


Xh  y  ~~  ^  I Uy  ^  ^  1  ^  M ’ 


(13) 


which  is  the  well-known  hedgehog  quark  state  considered  in  Chodos  and  Thorn  ( 1 975), 
Brown  and  Rho  (1979),  Vento  et  al  (1980),  Birse  and  Banerjee  (1984)  and  Kalbermann 

and  Eisenberg  (1984). 
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Inserting  (13)  into  (9)  yields 

.  6gp(k) 


£»(k)  =  -i 


(2w(k))3/2 

For  the  corresponding  pion  mean  field, 
one  obtains  after  an  explicit  calculation 


(kx$j  i  4-  ky5 j2  +  kzSJ3). 


(14) 


(15) 


4> i (r)  =  -  G(r)  <j>2( r)  =  -  G(r)  <j>} (r)  =  -  G(r), 

r  v  v 


with 


G(r)  =  3<7(2/7t)1/2 


3  P(k) 


dkk' 


[co(k)]: 


h  (kr). 


(16) 


(17) 


Both  (13)  and  (16)  show  the  well-known  and  always  assumed  hedgehog  properties. 
Apparently  they  arise  from  the  variational  principle  formulated  above,  ie  vqf. 


4.  Angular  momentum  and  isospin  properties  of  the  hedgehog  solution 

The  hedgehog  solution  |  iph  )  with  %  of  (13)  and  £  of  (14)  satisfies  for  i  —  x,  y,  z  the 
equations 

(JW +P?)\4>„>  =  0  (18) 


and 


(jj->+jr))i*»>“<>. 


(19) 


The  first  one  (18)  is  rather  trivial  to  show,  if  (13)  is  considered.  The  second  one  (19)  is 
more  difficult  and  will  be  shown  explicitly  in  this  section. 

The  expansion  of  the  coherent  state  \£h )  in  terms  of  zero  boson,  one-boson,  two- 
boson  states,  etc.,  yields 

K»>  =  °  +  S  |d3^(k)a;(k)|0>  +  ix  fd3k  fd3fc'^(k)^.(k') 

j  J  z •  jf  J  J 

af  (k )ay  (k')|0)  4-  .... 

Due  to  the  structure  of  the  coherent  state  \£h )  it  is  sufficient  to  show  that 


(j(*)+r(*))£  dX(kH+(k)|0>  =  0, 


(20) 


In  a  cartesian  three-dimensional  representation  one  has  for  the  isospin  operators  the 
matrices 

0  0  0\  /  0  0  i 

T[n)  =  [0  0  -i  T(2n)  =  |  0  0  0 


0  i  0 

If  one  writes  af  (k)|0)  =  | k, jf  >  and 


|M>  =  (  0  )|k> 


—  i  0  0 


O' 


/ 


0  -i  0 

r?)  =  (  i  oo). 
o  oo 


=  (  i  ||k> 


k,  3  > 


O' 

0  1 1  k  >, 
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one  obtains  immediately 

r(1”'|k,l>=0  7Y,|k,2>  =  i|k,3>  7Y»|k,3>= -/(k,2> 

7lTlik)l>=  -i|k,3>  r(2”)|k,2>  =  0  7^'  |  k,  3  >  =  i|k,  1  > 

n',|k,l>  =  i|k,2>  TYMk,  2  >  =  —  i|k,  1  >  7t’|k,3>  =  0. 

Thus  one  gets  eg  with  (14) 

6  gp(k) 


T\ 


d3^1(k)a1+(k)|0>  = 


[2co(fc)]3/2  1  "> 

The  operator  Jl  can  generally  be  written  as 


{-kf  k,3>  +  /c2  k,2>}d3k.  (21) 


Jk=  Z  %mn<LP 


n  > 


(22) 


m,  n 


where  q  and  p  are  coordinate  and  momentum  operators.  This  yields  in  the  momentum 
representation  for  example 


Jl  —  i  |d3fc|k> -^-/cz<k|-i  d3/c|k >  ~fc,<k|. 


(23) 


Hence  one  obtains 


Jni 


d3k’{l(k')al  (k)|0>  =  i  6g 


d  3k' 


P(k') 


[2g)(/c')]3/2 

x  fd3lc|k>^-fcz{-A:;<k|k',  l>-/c;<k|k',2>-fc;<k|k',3>} 


d3fc|k>  —  ky{-k’x(k\k',  i>-k;<k|k\2>-fc'<k|k\3>}, 
ok - 


giving 


Jl 


d3k'  lh(k')al  (k')|0>  =  -6  g 


[2<n(k)]3/2 

{P  f)  S 

k,  1  >i  —  kz(  —  kx)+  | 2>  —  kz{  —  ky)  +  \k,  3  >  kz(-kz) 

~  k'1>^:fcr(-kx)-|k-2>^/cy(-k,)-|k,3>  — ^,(-^)|. 
The  explicit  evaluation  yields  now 

Jl  \d3k'Z1(k')at(k')0  =  6g 


r  3,  pW 


(24) 

Comparison  of  (24)  with  (21)  shows  that  we  have  proven  the  assertion  for  the 
considered  special  case  i  =  1  in  (10).  The  proof  for  the  other  cases  is  similar. 

From  (8)  and  (9)  there  follows  the  important  fact  that  a  decomposition  of  the  total 
hedgehog  state  |iA„>  in  terms  of  states  with  good  spin  J  and  isospin  T  yields 


\<l'k>  =  Iaj\J’T=J> 


(25) 
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Thus,  in  order  to  include  quantum  correlations,  only  a  projection  on  one  of  the 
quantum  numbers,  involving  only  one  set  of  Euler  angles,  is  sufficient  for  a  hedgehog 
state.  However,  this  simplification  does  not  hold  if  one  first  projects  on  states  J,  T  and 
then  performs  a  variation.  In  such  a  case  there  is  no  reason  why  |i p}  should  not  have 
components  with  J  ^T. 


5.  Projection  techniques 


A  proper  way  to  proceed  would  be  now  to  perform,  first,  a  projection  on  good  J-  andT- 
quantum  numbers  according  to  (7)  and  second,  to  vary  the  corresponding  energy  into 
its  minimum.  This  is  a  rather  complicated  procedure  which  has  not  been  investigated 
yet.  A  simpler  way  would  consist  in  performing  the  projections  after  the  variation,  ie  to 
perform  a  projection  of  a  hedgehog  state.  Even  this  is  complicated  since  a  hedgehog 
state  has  nonaxial  components  in  spin  and  isospin  space.  In  order  to  explore  the 
projection  techniques  a  simpler  case  is  considered  here,  where  the  intrinsic  three-quark 
states  are  coupled  (Urbano  and  Goeke  1984b;  Theberge  et  al  1980)  to  the  ground  state 
of  a  proton  with  spin  up,  \Nf/2  )  mixed  with  a  state  |  Af/2  ).  Thus  the  bare  nucleon  state 
is  assumed  to  be 


BN  )  =  cosa  NiI2  )  +sina  A1/2  ). 


For  such  a  state  one  obtains  the  energy 


dp  sin  P 


o 


dp  sin  pdJl/2  i/2(/?)dT/2  1/2 (P)h(P>  P) 


0 


EJT{ot,  R)  — 

dp  sin p  I  dpsinpdJl/21,2{P)dTl/2l/2{p)n{P,P) 

Jo  Jo 

with  the  overlap  kernels 

h(oc;P,P)  =  <BN|<£|Hexp{  ipjy  +  ijj  J3 }  |  BN  >  |  £  >, 
n(z;  P,p)  =  <BN|<£|exp  {ipjy  +  ipJ3 } | BN > | { >• 

For  the  evaluation  of  (28)  it  is  necessary  to  know  the  rotated  pion  field  state 


(26) 


(27) 


(28) 


exp  {ifiJy  +  ijjj 3 }  |0  =  —  exp<[  £  j  d3fc  ^(kj;  pf!)aj  (k)  [  |0  >, 


with 


«AW)  =  I[«3(/5)LO  (V(/w> 

j' 

where  N  is  a  normalization  constant.  Following  the  formalism  of  Urbano  and  Goeke 
(1984a)  some  results  of  the  calculations  are  shown  in  figure  1.  There  the  MiT-energy,  the 
intrinsic  energy  and  the  projected  energy  are  contrasted.  One  realizes  that  the  effect  of 
the  projection  is  biggest  at  small  bag  radii  where  the  pion  field  is  strongest.  Actually  for 
very  small  K-values  one  would  obtain  a  collapse  of  the  system  if  this  is  not  prevented  by 
an  additional  mechanism.  In  the  present  investigation  this  is  done  by  assuming  the  qq- 
component  of  the  pion  to  have. a  finite  extension  (Urbano  and  Goeke  1984b) 
characterized  by  its  radius  rjn.  For  a  physical  pion  the  rjn  is  related  to  the  pion  decay 
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Radius  of  the  Bag  [fm] 

Figure  1.  The  total  nucleon  energy  is  given  for  various  bag  radii  R.  Considered  are  the  mit- 
model,  the  VQF-formalism  in  the  mean  field  approximation  (intrinsic)  and  with  angular 
momentum  and  isospin  projections  included  (projected).  For  the  last  case  rjn  =  0- 1 7  fm  and  t]n 
=  0  fm  are  considered  to  demonstrate  the  instability  for  small  radii  for  vanishing  t]n.  The 
parameters  of  the  calculation  are  B1/4  =  0125  GeV,  g  =  14GeV-1,  Z  =  1-75. 


-60  -30  0  30  60 

Mixing  Angle  a  [deg] 


Figure  2.  The  average  number  of  pions  in  the  intrinsic  nucleon  state  is  given  in  dependence 
on  the  mixing  angle  a  for  various  bag  radii  R.  The  parameters  of  the  calculation  are  those  of 
figure  1. 

constant  and  comes  out  to  be  >u  =  017  fm.  This  value  is  used  in  the  calculations. 
An  interesting  quantity  is  the  average  number  of  pions  in  the  cloud.  This  is  given  by 

N„=<£|£  d3fca/  (k)flj(k)|^  >, 
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and  for  the  projected  states  analogously.  Some  results  are  found  in  figures  2  and  3. 
Figure  2  shows  for  various  /^-values  the  intrinsic  number  of  pions  in  dependence  on  the 
mixing  angle  a.  One  sees  a  clear  variation  showing  maximal  values  at  a  around  40°.  For 
bag  radii  of  about  1-2  fm  only  small  values  of  Nn  are  obtained  which  are  at  most 
Nn  %  0-2.  The  projected  results  are  different.  They  are  displayed  in  figure  3  for  various 
f/^-values  in  dependence  of  R.  A  comparison  between  both  figures  shows  that  the 
projected  values  are  noticeably  larger  than  the  nonprojected  ones.  This  holds  even  for 


Figure  3.  The  average  number  of  pions  in  the  projected  nucleon  state  is  given  in  dependence 
on  the  bag  radius  R  for  various  mixing  angles  a.  The  parameters  of  the  calculation  are  those  of 
figure  1. 


Figure  4.  The  self-energy  the  bag  acquires  due  to  the  coupling  with  the  poin  field. 
Considered  is  the  intrinsic  solution  with  the  parameters  of  figure  1  for  various  a-  and  /^-values. 
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Radius  of  the  Bag  [fm] 

Figure  5.  The  self-energy  of  the  projected  state  for  various  a-  and  R-values.  The  parameters 
are  taken  from  figure  1. 


a  =  0°  and  R  ~  1  -2  fm  where  the  effect  of  the  projections  on  the  energy  is  rather  smaH. 

The  self-energy  of  the  system  shows  a  behaviour  as  it  is  displayed  in  figures  4  and  5.  It 
is  noticeably  larger  in  magnitude  for  the  projected  case.  For  all  the  above  data  it  should 
be  noted  that  due  to  the  finite  pion  size  the  effective  coupling  constant  goes  to  zero  for 
R-+  0. 


6.  Summary 

The  objective  of  the  present  investigation  was  to  illuminate  angular  momentum  and 
isospin  properties  of  chiral  bag  models.  To  this  end  the  variational  quantum  field 
theoretical  approach  (vqf)  of  Urbano  and  Goeke  was  considered.  It  was  first 
demonstrated  that  the  well  known  hedgehog  structure  appears  by  variational 
techniques  if  a  Cloudy  Bag  Hamiltonian  is  used.  It  was  then  shown  that  this  hedgehog 
allows  for  simplifications  of  the  projection  techniques  to  be  used.  Those  were  then 
studied  in  a  simplified  model  showing  that  particularly  the  average  number  of  pions  in 
the  field  is  a  rather  sensitive  number.  Stability  of  the  system  can  be  guaranteed  if  a  finite 
size  of  the  pions  in  the  cloud  is  assumed. 
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Relativistic  harmonic  confinement  of  quarks  and  gluons 
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Physical  Research  Laboratory,  Navrangpura,  Ahmedabad  380009,  India 

Abstract.  A  current  confinement  model  for  glue-balls  is  proposed  to  maintain  complete 
similarity  with  relativistic  harmonic  oscillator  model  (rhm)  with  Lorentz  scalar  and  vector 
potentials  for  quarks.  The  spurious  motion  of  the  centre  of  confinement  is  accounted  for  in 
these  models  in  exactly  the  same  manner.  We  shall  review  some  of  the  successes  of  the  rhm  in 
hardronic  masses,  magnetic  moments  of  baryons,  nucleonic  sizes  and  polarisability  and 
proton-antiproton  annihilation,  as  well  as  the  recent  results  on  glue-balls. 

Keywords.  Relativistic  harmonic  confinement;  quarks;  gluons;  glue-balls;  hadrons;  masses; 
magnetic  moments. 


PACS  No. 


1.  Introduction 

It  is  now  accepted  that  strongly  interacting  particles  i.e.  hadrons  consist  of  quarks  and 
gluons  and  the  dynamics  of  their  ‘colour’  is  quantum-chromo-dynamics  (qcd).  The  qcd 
is  asymptotically  free  and  one  expects  it  to  show  infrared  slavery  or  confinement  in 
accord  with  the  empirical  fact  that  only  colour  singlet  objects  are  seen  free.  Starting 
from  a  confinement  model  one  can  derive  properties  of  hadrons  and  this  was  the 
philosophy  of  early  bag  models  (Bogolinkov  1967,  1972;  Chodos  et  al  1974).  These 
models  though  relativistically  suffered  from  sharp  surface  and  spurious  motions  of  the 
centre  of  confinement  (cm).  The  non-relativistic  constituent  quark  models  (Isgur  and 
Carl  1977,  1978,  1979a,b)  are  not  consistent  though  they  could  explain  many  aspects  of 
hadron-spectroscopy.  The  relativistic  harmonic  oscillator  (Lorentz  scalar  plus  vector) 
model  (rhm)  (Khadkikar  and  Gupta  1983)  combined  the  desirable  features  of  both 
and  was  eminently  successful  in  explaining  very  diverse  aspects  of  hadron  spectroscopy 
and  magnetic  moments,  nucleon  polarisability,  nucleon  anti-nucleon  annihilation,  etc. 
We  shall  review  some  of  the  results  in  the  following,  qcd  predicts  that  there  should  be 
new  kinds  of  particles — colour  singlet  states  of  two  or  more  gluons-  glue-balls  and 
hybrids  of  quarks  and  gluons — meiktons.  Here  we  shall  propose  a  particular  scheme 
for  confinement  of  gluons  on  similar  lines  to  rhm  for  quarks.  We  shall  then  compare  the 
results  to  experimental  candidates  for  glue-ball  states.  In  the  end  we  shall  present  an 
outlook  for  future  work. 


2.  rhm  for  hadrons  with  ‘interaction’  effects 

The  rhm  for  light  hadrons  with  a  constant  effective  one  gluon  exchange  hyperfine 
interaction  (Khadkikar  and  Gupta  1983)  gave  quite  an  impressive  account  of  hadronic 
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masses  and  baryonic  magnetic  moments.  However  there  was  the  problem  of  the  pion 
being  too  heavy.  Also  the  magnetic  moments  though  better  than  other  model 
calculations  were  not  in  precise  agreement  with  experimental  values.  This  meant  that 
the  additional  interaction  effects  on  the  magnetic  moments  which  were  neglected  are 
not  insignificant.  By  using  group  theoretic  methods  one  can  analyse  the  contribution  to 
the  magnetic  moments  in  terms  of  unit  tensors  of  (SU3  x  SU3)  flavour  group.  Indeed  a 
single  tensor  was  found  so  that  the  discrepancy  of  the  rhm  moments  from  experimental 
values  could  be  removed.  The  nature  of  the  tensor  is  such  as  to  satisfy  Marshak-Okubo- 
Sudarshan  sum  rule  and  the  trace  of  the  magnetic  moment,  over  the  flavour  octet  is 
nearly  zero.  The  results  obtained  (Gupta  and  Khadkikar  1984)  are  given  in  table  1. 


Table  1.  Magnetic  moments  of  octet  baryons  with  ^-moments. 


Baryons 

rhm  +  T 

Experimental 

Fit 

P 

2-759 

2-7928456 

2-793 

n 

— 1-894 

— 1-91304184 

—  1  913 

A 

-0-572 

-0-613±0.004 

-0-614 

I  + 

2-335 

2-38±0-02 

2-388 

1° 

0-605 

— 

0-633 

I" 

-1-126 

—  1  1 1 1 10.033 

-1-123 

s° 

— 1-177 

-1-25010-014 

-1-245 

M  ~ 

\mmd 

-0-637 

-0-6910.04 

-0-685 

A-I° 

-1-499 

-1-82  +  0-26 

-1-520 

-0-18 

The  interaction  effects  are  simulated  by  a  tensor  in  isospin  T  = 

d  I(2I-1)  +  4I3  (I 

—  1),  d  is  the  only  parameter.  We  also  show  a  fit  when  the  three  quark  magnetic 

moments  are  treated  as  free  parameters. 

Table  2.  Calculated  masses  of  light  hadrons 
with  renormalised  strength  in  (n,  K)  systems 
(Khadkikar  and  Gupta  1984)  compared  with 
observations. 

Hadron 

Observed  (MeV) 

(MeV) 

pin 

939 

939 

A 

1116 

1115 

I 

1189 

1181 

M 

1321 

1330 

A 

1232 

1232 

I* 

1385 

1375 

E* 

1533 

1523 

Q‘ 

1672 

1678 

P 

770 

745 

K* 

892 

901 

CD 

783 

745 

<t> 

1019 

1068 

n 

139 

135 

K 

495 

497 
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The  other  problem  of  the  pion  being  rather  heavy  arises  because  the  one  gluon- 
hyperfine  interaction  is  treated  perturbatively  and  because  the  interaction  is  attractive 
in  7i  and  p  one  can  show  that  the  diagonalisation  of  the  interaction  matrix  in  rhm  states 
leads  to  the  lowering  of  the  n  and  p  ground  states  so  as  to  agree  with  observed  masses 
(Gupta  and  Khadkikar  1984,  table  2). 

Considering  the  successes  of  rhm  which  we  shall  further  discuss  later,  it  is  tempting  to 
construct  a  gluon  confinement  model  on  parallel  lines.  One  should  keep  in  mind  that 
the  success  of  rhm  was  closely  linked  with  the  accounting  for  spurious  cm. 


3.  A  confinement  model  for  glue-balls 

qcd  predicts  the  existence  of  gluonic  bound  colour  singlet  states  called  glue-balls.  For, 
short  of  any  exact  solutions  of  qcd  as  in  the  case  of  hadrons,  one  can  proceed  to  make 
models  with  confinement  as  the  prime  criterion.  Dielectric  confinement  models  have 
been  in  vogue  e.g.  see  Lee  1981).  In  the  harmonic  confinement  scheme,  however,  the 
dielectric  confinement  alone  seems  to  generate  tremendous  complications  especially  for 
the  colour  electric  modes  (Khadkikar  and  Vinodkumar  1984)  for  want  of  a  suitable 
gauge  choice.  The  simplicity  is  recovered  though  if  one  assumes  that  the  confinement  is 
due  to  current  ( J)  which  is  proportional  to  the  ‘gluon’  vector  field  (A)  for  each  colour, 

J  =  —a2r2  A,  (1) 

where  r  is  position  vector  from  the  centre  of  confinement.  It  is  now  straightforward  to 
write  down  Maxwell’s  equations  with  current  source  (1)  and  solve  them  in  a  Lorentz 
gauge  to  obtain  for  A 

(  —  V2  -\-a2r2)  A  =  co2 A.  (2) 

We  have  to  still  eliminate  the  additional  (longitudinal)  degree  of  freedom  in  A.  The 
natural  choice  here  is  what  we  might  call  an  oscillator  gauge.  One  can  write  down  (2)  in 
oscillator  quantas 

(a-a+ +a+  a)  A  =  co2A,  (2a) 

where  a  =  — -r —  (V  -f  ur), 

V  2a 

and  a+  is  the  hermitean  conjugate.  The  usual  radiation  gauge  conditions  V-  A  =  0  is 
now  replaced  by 

a  ■  A  =  0  (3) 

in  the  oscillator  gauge,  implying 

V  •  A  +  ar  •  A  =  0, 
d(p 

and  —  ar  •  A  +  —  =  0, 

dt 

where  (p  is  the  coulomb  potential. 

It  is  now  seen  that  in  this  gauge  the  scalar  modes  are  nullified  by  the  radial 
component  of  the  vector  gluons.  The  electric  (E)  and  magnetic  (M)  modes  are  now 
given  by  the  conditions: 


(4) 

(4a) 


P—5 
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a+  •  A  =  0  (M),  (5) 

and  L-A  =  (a+  xa)-A  =  0  (E).  (5a) 

The  M  and  E  modes  then  are  given  in  terms  of  a  scalar  oscillator  function 
satisfying  the  eigenvalue  equation  the  same  as  A  as  follows: 

and  A£  =  (a  —  a  +  — p  a  •  a)<£(„  +  1)(m.  (6) 

a  a 

Thus  n  =  1  and  n  —  0  are  the  lowest  M  and  E  modes  respectively.  The  quantization 
procedure  can  be  then  carried  out  as  usual  with  frequencies, 

con  =  ±  (2 n  +  3)1/2  a112,  n  =  0, 1,  2,  .  .  . 

But  there  is  spurious  cm,  to  remove  which  we  follow  the  method  of  Khadkikar  and 
Gupta  (1983).  For  this  one  must  construct  A-gluon  states  with  no  quantas  in  cm 
motion.  Still  there  is  the  null  point  motion  and  its  average  is  deducted  from  0)1  for  each 
gluon.  Thus  the  intrinsic  frequencies  (energies)  are  given  by  (^  =  c  =  1): 

e„  =  ^2n  +  3-0,2a1/2.  (7) 

Thus  the  masses  of  glue-balls  can  be  calculated  using  (7)  and  the  properly  constructed 
intrinsic  states.  These  are  tabulated  (table  3).  For  comparison  the  experimental 
(Chanowitz  1 982)  candidates  are  also  shown.  We  also  show  the  results  for  trigluon  glue- 
balls  (Khadkikar  and  Vinodkumar  1984).  These  are  predictions  and  in  all  only  two 
glue-ball  candidates  are  known.  It  still  remains  to  see  the  effect  of  residual  interactions 
as  gluons  couple  to  themselves. 

It  is  clear  that  the  mere  act  of  confinement  is  able  to  give  plenty  of  information  of  the 
strongly  interacting  particles  and  one  must  look  for  applications  in  all  aspects.  We  shall 
present  some  of  these  in  §4. 


Table  3.  Masses  of  glue-ball  states  with  their  possible  angular  momentum  (/)  and  parity 
( P )  charge  conjugation.  C  labels  are  given  for  two-  and  three-gluon  (em)  combinations  along 
with  masses  of  two  experimental  candidates. 


Calculations  Experiment 


Modes 

JPC 

Mass  MeV 

JPC 

Mass  MeV 

EE 

0++,  2+  + 

1137 

— 

— 

EM 

0"  +  ,  2~  + 

1440 

0“  + 

1440 

MM 

0++,  2+  + 

1703 

2+  + 

1700  ±  50 

EEE 

0"+,  2-+,  3” 

1971 

— 

— 

EEM 

0++,  2++,  l  +  ~,  3  +  ~ 

2296 

— 

— 

EMM 

o~+,2-+,  r-,3-- 

2489 

— 

— 

MMM 

0++,  2++,  l  +  ~,  3  +  - 

2720 

— 

— 
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Table  4.  pp  annihilation  in  annihilation  (^/rearrangement  ( B )  models  at  rest.  Branching  to 
some  channels  (Furui  et  al  1984). 


0 

7t  +  71 

a 

+ 

i 

a 

© 

71 +  71  X° 

2n  +  2n 

2n  +  2n  7i° 

2n+2n  x° 

A 

2-18 

0 

5-86 

34-93 

0-69 

19-57 

19-88 

B 

Exper- 

2-0 

0 

5-84 

33-14 

1-76 

17-01 

23-14 

imental 

41 

0-37  ±0-3 

6-9  +  0-35 

35-4  ±6-7 

6-9  +  06 

19-6  +  0-7 

20-8  +  2 

4.  Some  applications  of  rhm 

The  relativistic  wavefunctions  in  rhm  can  be  transformed  to  non-relativistic  ones  with 
corresponding  transformed  operators  using  Foldy-Wouthuysen  transformation. 
Hence  all  phenonemonology  with  constituent  quarks  remains  essentially  valid.  This 
was  especially  verified  in  the  case  of  baryonic  magnetic  moments. 

The  proton-polarisability  calculations  eliminating  spurious  cm  and  rhm  for  quarks 
(Kuyucak  1984)  has  given  much  better  agreement  with  experimental  parameters  than 
bag  models  or  the  non-relativistic  oscillator  model  which  over  estimates  by  a  factor  of 
3/2.  Moreover,  rhm  is  the  only  model  which  has  internal  consistency  in  that  it  gives 
[  <r2  }ch~\l/2  and  polarisability  sum  rule  (0*9 1  /,  1 2*7  10“ 53  cm3)  in  agreement  with  the 
experimental  values  (0-88/,  14-2  +  03  10~43cm3). 

The  proton-antiproton  annihilation  branching  ratios  to  various  mesons  represent 
yet  another  application  of  the  confinement  model.  While  it  was  found  that  rhm  is 
superior  to  other  similar  models  in  3-meson  decays,  the  2-meson  decays  representing 
annihilation  of  a  qq  pair  to  vacuum,  presents  some  uncertainty  (Furui  et  al  1984).  The 
calculated  branching  ratios  in  some  channels,  however,  are  in  remarkable  agreement 
with  experimental  ones  (table  4). 

Several  more  applications  such  as  effective  quarkpion  coupling,  pion-nucleon 
coupling,  nucleon-nucleon  force  and  exotics  such  as  hybrid  meiktons  using  rhm  and  the 
gluon  confinement  scheme  as  given  here  are  under  way. 


5.  Outlook 

It  appears  that  the  traditional  picture  of  point  hadrons  with  mutual  couplings  has  been 
replaced  by  a  few  gluon-quark  degrees  of  freedom.  This,  however,  throws  a  new 
challenge  to  understand  the  nucleus  from  the  ‘quark’  point  of  view.  The  conventional 
independent  particle  model  has  to  be  explained.  However,  the  limitations  of 
independent  particle/pair  picture  have  been  quite  obvious,  in  the  decade  long 
investigations  in  nuclear  matter.  Perhaps  exchange  forces  of  a  molecular  type  and  a 
geometric  model  point  of  view  expressed  ( e.g .  see  Ramanna  and  Raghavacharyulu  1975 
and  references  therein)  represent  premonition  of  the  radical  change  in  understanding  of 
the  nuclear  structure  which  is  yet  so  incomplete  on  a  fundamental  level. 


68 


S  B  Khadkikar 


References 


Bogoliubov  P  N  1967  Ann.  Inst.  Henri  Poincare  A8  163 
Bogoliubov  P  N  1972  Sov.  J.  Part.  Nucl.  3  71 

Chanowitz  M  1982  AIP  Conf.  1981  ed.  C  A  Heusch  and  W  T  Kirk  (AIP,  New  York) 

Chodos  A  et  al  1974  Phys.  Rev.  DIO  2599 

Furui  S,  Faessler  A  and  Khadkikar  S  B  1984  Nucl.  Phys.  A424  495 

Gupta  S  K  and  Khadkikar  S  B  1984  (to  be  published)  High  energy  Physics  Conf.  Jammu 

Isgur  N  and  Carl  G  1977  Phys.  Lett.  B72  109 

Isgur  N  and  Carl  G  1978  Phys.  Lett.  B74  353 

Isgur  N  and  Carl  G  1979a  Phys.  Rev.  D18  4187 

Isgur  N  and  Carl  G  1979b  Phys.  Rev.  D20  1191 

Khadkikar  S  B  and  Gupta  S  K  1983  Phys.  Lett.  B124  523 

Khadkikar  S  B  and  Vinodkumar  P  C  1984  Int.  Conf.  Nucl.  Phys.  Bombay  (submitted) 

Kuyucak  S  1984  Nucleon  polarisability  in  ‘ The  Quark  models ”  Preprint 

Lee  T  D  1981  Particle  physics  and  introduction  to  field  theory  (New  York:  Harwood  Academy) 

Ramanna  R  and  Raghavacharyulu  I V  V  1975  The  general  many  body  problem  (eds)  R  Pratap  and  J  C  Parikh 
p.  156 


Pramana,  Vol.  24,  Nos  1  &  2,  January  &  February  1985,  pp.  69-76.  ©  Printed  in  India. 


Isobaric  degrees  of  freedom  in  nuclei  as  determined  from  nonrela- 
tivistic  quark  model 

C  S  WARKE 

Tata  Institute  of  Fundamental  Research,  Homi  Bhabha  Road,  Bombay  400005,  India 

Abstract.  Isobaric  degrees  of  freedom  AA  in  nuclei  are  determined  from  the  quark  cluster 
model  of  a  nucleus.  These  additional  degrees  of  freedom  are  brought  in  by  the  coloured  quark 
exchange  between  different  nucleon  clusters  present  in  nuclei.  They  are  found  to  be  important 
in  the  region  of  momentum  transfer  near  3-5  fm'1.  The  mass  dependence  of  these  isobaric 
degrees  of  freedom  in  nuclei  turns  out  to  be  A516. 

Keywords.  Quark  model;  nuclear  quark  cluster  model;  closed  shell  nuclei;  baryon  spectra. 

PACS  No.  12  35;  12  40;  21  65 

1.  Introduction 

The  quark  models  based  on  predictions  of  quantum  chromodynamics  theory  have  been 
quite  successful  in  explaining  a  large  body  of  data  on  static  properties  of  hadrons 
(Chodos  et  al  1974;  De  Rujula  et  al  1975;  Dalitz  1977).  These  models  of  strong  nuclear 
interactions  provide  a  natural  framework  for  understanding  some  aspects  of  nuclear 
physics  which  were  either  ad  hoc  or  were  based  on  flimsy  grounds,  examples  of  which 
are  nuclear  short  distance  repulsion,  the  high  momentum  components  of  the  elastic  and 
inelastic  form  factors  of  nuclei  and  the  isobaric  components  of  the  nuclear  wave 
function.  Recently  Jaffe  et  al  1984;  Close  et  al  1983  have  argued  that  European  Muon 
Collaboration  (emc)  effect  may  be  due  to  the  nuclear  environmental  effect  on  the  quark 
structure  of  a  nucleon.  They  give  the  mass  dependence  of  this  effect.  Some  of  the 
problems  have  been  tackled  in  the  recent  past  (Matveev  and  Sorba  1977;  Hagaasen  et  al 
1980;  Smirnov  and  Tchuvilsky  1978;  Warke  et  al  1981)  in  the  six  quark  system  within 
the  framework  of  different  models.  In  this  paper  we  use  the  nonrelativistic  quark  model 
to  describe  nuclei  as  clusters  of  three-quark  baryons.  We  further  assume  that  the  quark 
structure  of  nucleons  in  a  nucleus  does  not  change.  However,  the  colour  quark 
exchange  takes  place  between  different  nucleon  clusters  in  a  nucleus.  This  quark 
exchange  generates  the  short  range  nucleon-nucleon  correlations  in  the  nucleus,  which 
is  vital  for  any  inter-nucleon  contribution  to  the  large  momentum  transfer  phenom¬ 
enon  or  short-range  effects  in  a  nucleus.  This  contribution  is  over  and  above  that  of  the 
individual  nucleons  alone.  Based  on  this  picture,  we  develop  here  a  formalism  for  the 
isobaric  and  hidden  colour  degrees  of  freedom  in  nuclei.  We  study  its  mass  number 
dependence  and  correlate  it  to  the  emc  effect  and  its  mass  dependence. 

2.  Nuclear  quark  model  and  an  isobaric  component  of  nuclear  wavefunction 

Let  us  consider  a  nucleus  of  mass  number  A  and  atomic  number  Z  whose  number  of 
constituent  quarks  are  3 A.  We  assume  that  baryon  structure  does  not  change 
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appreciably  inside  the  nucleus.  As  observed  in  our  earlier  studies  (Shanker  and  Warke 
1980a,  b;  Warke  and  Shanker  1979,  1980)  the  short  range  correlation  effects  between 
nucleons  in  a  nucleus  arise  naturally  from  the  quark  exchange  mechanism  between 
different  nucleon  clusters.  The  nth  nucleon  3-quark  cluster  wave  function  is 

i//<",(im,  \t ,  a)  =  Q|n)(3n  — 2,  3n-  1,  3 n)4m,i< 

x  (3m  —  2,  3w  —  1,  3n)t/'J,'"  (R„).  (1) 

In  (1),  x(n)  is  the  3-quark  spin-isospin  wavefunction  and  Qin)  is  their  translationally 
invariant,  correlated  space  wavefunction  which  is  determined  from  baryon  spectros¬ 
copic  studies  (Shanker  and  Warke  1980).  The  centre  of  mass  coordinate  of  3-quarks  of 
the  nth  cluster  is  Rn  and  is  the  nth  cluster  shell  model  wave  function.  Consistent  with 
the  nuclear  shell  model,  the  cluster  wave  function  is 


<P ,  i  t,  a)  =  m-n;  «-„•  (2) 

The  Pauli  principle  further  restricts  the  3 A  quark  wave  function  to  the  determinantal 
form  (Warke  1984), 


¥(1,2,  3,...;  3  A) 


The  wave  function  in  (3)  is  clearly  antisymmetric  with  respect  to  the  interchange  of 
nucleon  clusters  and  the  exchange  of  quarks  within  the  same  nucleon  cluster,  but  is  not 
antisymmetric  with  respect  to  the  interchange  of  quarks  between  different  clusters.  In 
order  to  make  it  completely  antisymmetric  in  all  the  3,4  quarks,  the  antisymmetrizer 
operator  s  is  applied  'F'4  =  CsT, 

s  =  [  l+I(-l)aPa]/N1/2. 

The  phase  (—1)“  is  the  signature  of  the  permutation  Pa.  The  normalization  factor 
N  —  (3i4)!/[/4!(3!)3].  We  observe  that  ¥  in  (3)  is  normalized  to  unity  but  s *F  is  not. 
Therefore  the  normalization  constant  C  is  introduced  in  'FA  In  writing  (3),  we  tacitly 
assumed  that  nuclei  under  consideration  are  completely  filled  shell  nuclei  and  that  the 
cluster  wave  functions  are  orthogonal.  We  further  restrict  our  study  to  nuclei  having 
N  —  Z  —  All.  In  principle,  the  nuclear  shell  model  wave  function  should  be  obtained 
from  the  nuclear  interaction  as  derived  from  the  same  quark-quark  interaction  as  that 
used  in  the  determination  of  the  3-quark  cluster  wave  function.  However,  since  such  an 
approach  gives  only  the  short  range  repulsion  of  the  nucleon-nucleon  interaction 
correctly,  this  approach  cannot  be  followed  at  this  stage  of  our  understanding. 
Therefore,  the  effect  of  long-range  interaction  is  included  phenomenologically  by 
assuming  the  nuclear  independent  particle  shell  model  of  a  nucleus.  The  oscillator 
parameter  of  this  wave  function  is  fixed  from  the  observed  nuclear  root  mean  square 
radius. 

As  each  cluster  wave  function  is  a  colour  singlet,  the  3  A  quark  wavefunction  'F  is  also 
colour  singlet.  Since  N  —  Z  and  we  are  considering  doubly  closed  shell  nuclei,  the  spin 
and  isospin  of  the  3  A  quarks  in  'F  are  zero.  The  quark  exchange  operator  is  scalar  under 
colour,  spin  and  isospin  transformations.  Therefore,  'F  A  is  also  a  colour  singlet  and  has 
zero  total  spin  and  isospin.  One  can  easily  contruct  the  following  spin,  isospin  and 
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colour  exchange  operators, 


patt)  =  P'(ij)  =  i(l  +tI-tJ-);  /*(«/)  =  i H- 2Ff •  F, 


In  the  colour  exchange  operator  P°,  F  are  the  eight  colour  SU(3)  generators.  Through 
the  antisymmetrization  process,  the  spin-isospin  exchange  operators  introduce  the 
A(3/2,  3/2)  spin-isospin  components  in  'F'1.  Similarly,  J*  introduces  the  colour  octet 
components  (hidden  colour)  in  'FA  As  'F'4  is  spin-isospin  and  a  colour  singlet  for  closed 
shell  nuclei,  this  mixing  occurs  in  AA  and  colour  8x8  pairs  to  lowest  order.  In  the 
following  we  derive  the  expressions  for  their  mixing  amplitudes. 

The  AA  wave  function  can  be  defined  analogous  to  that  for  the  nuclear  wave  function 
in  (3).  Any  two  clusters  in  'F  are  changed  to  two  delta  clusters  due  to  the  quark 
exchange.  Let  us  assume,  they  occupy  the  orbitals  <5j  and  S2  in  place  of  two  nucleons  in 
orbits  Pi  and  p2  occupied  in  *F. 


The  properly  antisymmetrized  3 A  quark  wave  function  is  =  s*FD.  Following  the 
definitions  used  in  nuclear  physics,  we  define  the  spectroscopic  factor  S  of  a  AA 
component  and  NN  component, 


S(AA,  S&;  PxPi)  =  |  <5'P0|4'^>|2  =  N\ <4'0|¥-<>| 


12. 


S(NN)  =  |<s'F|'F/1>|2  =  N|<VF|T",>|2. 


Note  that  although  XPD  does  not  have  good  spin  and  isospin,  it  is  a  colour  singlet.  The 
required  spin  and  isospin  of  X¥D  is  correctly  projected  out  in  the  overlap  amplitudes 
defined  in  (5).  Therefore,  the  complication  of  constructing  good  spin  and  isospin 
manybody  state  xi,D  is  avoided.  In  the  overlap  integrals  in  (5),  antisymmetrization 
introduces  quark  exchanges  between  A  clusters.  It  is  expected  that  quark  exchange 
probability  will  fall  faster  with  the  number  of  clusters  between  which  the  quarks  have  to 
be  exchanged.  This  is  because  quark  exchange  between  different  clusters  require  these 
clusters  to  be  overlapping.  The  probability  of  three  nucleons  overlapping  in  a  nucleus  is 
smaller  than  that  of  two  nucleons  overlapping  each  other.  We  keep  the  contribution  of 
quark  exchanges  between  two  clusters  and  neglect  that  coming  from  the  multi-cluster 
quark  exchanges  in  evaluating  the  spectroscopic  factor  in  (5).  With  this  assumption  we 
have 


XXX  Pi(m),j(n) 
m  <  n  i(m)  j(n) 


¥>. 


The  sum  over  m,  n  in  the  second  term  in  (6)  is  over  the  A  nucleon  clusters  in  a  nucleus. 
The  additional  sum  i(m)  and  j(n)  is  over  the  quarks,  the  ith  quark  in  the  mth  cluster  is 
exchanged  with  the  jth  quark  in  the  nth  cluster  j(n).  Because  of  the  spin-isospin  (3/2, 3/2) 
wave  function  orthogonality  of  delta  to  that  of  a  nucleon  (j,  f),  the  first  term  in  (6) 
vanishes.  Again  from  the  orthogonality  of  the  shell  model  single  particle  wave 
functions,  the  second  term  reduces  to 


<TC|^>=  lPi(1U(2) 


-9<IA<>V‘2>|P 


3(1),  3(2) 


Now  onwards  the  cluster  indices  1  and  2  in  (7)  can  be  dropped  since  there  is  no  possible 
ambiguity.  It  is  to  be  noted  that  the  first  term  in  (6)  will  be  unity  for  S(NN)  calculation. 
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Let  us  evaluate  separately  the  colour  exchange,  spin-isospin  exchange  and  radial 
exchange  matrix  elements  (me)  in  (7).  The  colour  singlet  wave  functions  of  two  nucleons 
are  4(1,  2,  3)  and  4(4,  5,  6)  and  the  colour  exchange  operator  is  Pc.  Using  the 
determinantal  form  of  4  the  colour  part  of  the  me  can  be  easily  calculated 


<40,  2,  3)4(4,  5,  6)  1^(16)14(4  2,  3)4(4,  5,  6)>  =  1/3.  (8) 

The  integration  over  quark  internal  coordinates  in  a  cluster  with  the  quark  space 
exchange  operator  in  (7)  requires  the  delta  space  wave  function.  We  assume  that  spatial 
quark  structure  of  the  delta  and  a  nucleon  is  the  same.  Using  the  baryon  wave  function 
derived  by  Shanker  et  al  1980a  from  the  fit  of  nucleon  spectroscopic  data,  one  has 

<6A(  1,  2,  3)Qa(4,  5,  6)  |l»(16)|fiw(l,  2,  3)fiN(4,  5,  6)> 

=  g/u  12  exP  ( ~  l’8ft2r2)  B  v(r).  (9) 


In  the  above  equation  ft  is  the  parameter  of  the  translationally  invariant  wave  function 
md  r  is  the  relative  separation  of  the  centre  of  masses  of  the  two  clusters.  The  value  of 

i=2-02  fm-1  (Shanker  et  al  1980a).  The  quark  space  exchange  operator  exchanges  the 
position  coordinates  of  quarks  1  and  6  not  only  in  the  clusters  internal  wave  functions 
but  also  in  the  relative  wave  function  of  their  centre  of  masses.  We  have  neglected  it  in 
(9),  which  is  justified  because  the  cluster  shell  model  wave  functions  are  expected  to  vary 
slowly  with  r  relative  to  the  cluster  internal  wave  function.  The  usual  transformation  is 
used  to  transform  the  nuclear  shell  model  wave  functions  in  (9)  to  the  ls  coupling 
scheme.  This  allows  us  to  express  the  six  quark  spin  isospin  wave  functions 
corresponding  to  a  nucleon  or  a  delta  in  the  following  form 


U n,d%sn,dSMs',  3  >• 

Although  lengthy,  it  is  straightforward  to  evaluate  now  the  spin  isospin  me. 
<*3/2  3/2(1,  2,  3)X3/2  3/2(4,  5,  6)  S' M rr3  |P*(16)Pf(16)| 

X1/2  1/2(1,  2,  3)xi/2  1/2(4,  5,  6)5Ms-7T3  ) 


=  gj- [ (9  ~  4S)(9  -  4T)J- 1 12 S (SMsTT3  ;  S' M  ST T'3 ). 


(10) 


Using  the  results  from  (8)  to  (10)  in  (7),  the  six  quark  me  is  reduced  to  the  usual  shell 
model  type  me  of  a  two-body  interaction  for  the  centre  of  masses  of  the  clusters.  This 
effective  two-body  interaction  is 


v(r,  ST)  = 


"^[(9 -45)(9  —  4 T)yi2v(r). 


(11) 


The  exchange  integral  in  (7)  is  reduced  to 


STL 


(12) 


The  antisymmetric  two-body  me  has  =  o[l  +(  -  l)s+r+lP*(12)],  and  the  geometric 
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transformation  factor  X  is 

X(STL ■  d6':m=I1U’n,j'm'\JM)fUm,j'm'\JM)f(it,  '  |7T3)„ 


JM 


(it,  it'  \TTM2L  +  l)(2S+l)[(2/+l)(2/'  +  l)]‘'2 
[(?/  +  1)(2/  +  l)]j/2 


/ 

1/2  j 

( 

l 

1/2  j 

/' 

1/2  / 

►  < 

l 

1/2  f 

S  J 

P 

L 

V 

S  J 

/ 

(13) 


In  (13)  the  subscript  p  and  <5  indicate  the  NiV  pair  in  shell  model  orbits  p  =  (np  Ipjptnp) 
and  similarly  for  p'  and  a  pair  of  deltas.  The  corresponding  values  of  quantum  numbers 
are  to  be  used  in  (13).  Equation  (12)  gives  the  probability  amplitude  for  finding  the  two 
3-quark  clusters  of  nucleons  in  a  nuclear  shell  model  states  p  and  P'  and  in  the  two  deltas 
in  shell  model  states  3X  and  S2 .  The  probability  P  (S3' :  pp' )  is  a  square  of  this  amplitude. 
The  total  probability  of  finding  a  pair  of  AA  isobars  in  a  nucleus  would  be  the  sum  over 
the  occupied  nucleon  orbitals  pp'  in  a  given  nucleus  and  over  all  the  possible  shell  model 
states  33'  of  the  isobar  pair  (including  their  charge  states).  The  sum  over  33'  is 
performed  using  the  orthogonality  relation  of  the  9  —j  symbols  and  the  completeness 
relation  of  the  harmonic  oscillator  states.  The  sum  is  then  expressed  in  terms  of  the 
probability  of  creating  an  isobar  pair  A  A  at  positions  rx  and  r2  inside  a  nucleus.  As  we 
are  studying  only  closed  shell  nuclei  with  N  =  Z,  the  sum  over  m,  m  and  t ,  t'  (p,  P')  in  PD 
can  be  easily  performed.  The  isospin  state  sums  of  the  AA  pair  further  simplify  this 
expression.  The  reduced  form  of  PD  of  creating  a  AA  pair  at  positions  rj  and  r2  inside 
the  nucleus  becomes 

PD(ri,r2)  =  I  (2T  +  V)(2L  +  1)(2S  +  1)(2J  +  1)(2/  +  1)(2/'  +  1) 


nlj,  ri  If 
LSTJ 


l 
l 


1/2  j 

1/2  r 

S  J 


(tl,r2\va(ST)\nl,  n'l ;  LM)\ 


(14) 


The  sum  over  nlj ,  n'l'f  in  (14)  extends  over  the  occupied- orbitals  in  the  nuclear  shell 
model  wave  functions.  Let  us  transform  the  AA  coordinates  in  the  centre  of  mass  and 
relative  coordinates  R  =  (rj  +  r2)/2  and  r  =  -  r2  and  use  Moshinsky  transformation 

brackets  <  nl,  n'l  ||  L\\  nl,  N  A  >.  After  integrating  PD  over  the  coordinate  R  and  angular 
orientation  of  r,  one  obtains  the  probability  of  finding  an  isobar  pair  AA  in  a  nucleus  at 
a  relative  separation  r  between  them. 

|  (r\va\nl,  n'l;  L  >|2  =  1  [1  +  (- l)s+r+'+1] 

Hj/lN  A 

<  nl,  n'l  ||L  |  nlll ,  NA  yQlh  (r)u2(r,  ST).  (15) 

In  (1 5)  Qnl  (r)  is  the  radial  wave  function  of  the  relative  motion  of  the  AA  pair  inside  the 
nucleus.  From  (14)  and  (1 5)  we  obtain  the  final  expression  for  finding  an  isobar  pair  in  a 
nucleus  at  separation  r 

P»  =  I  (2P+  1)[1  +(-l)s+r+,+1]  CsJv2(r,  ST)Qli(r). 

nlST 


(16) 
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The  nuclear  structure  dependent  constants  C ^  are  defined  as  the  sum  over  occupied 
orbitals  a,  b  in  a  nucleus 


(L  1/2  ja')  2 

CSJ=££( 2A+l)(2J  +  l)(2ja+l)(2jb  +  l)llb  1/2  jb  V 
«• b  u  (IS  JJ 


x  J  <«/,  NL 

NL 


/l 


nala. 


nb  4 


>: 


(17) 


3.  Delta  isobar  components  in  160  and  40Ca  nuclei 

In  these  doubly  closed  shell  nuclei,  the  sum  over  spin  and  isospin  can  easily  be  carried 
out.  The  probability  PD(r)  in  (16)  becomes 

PD(r,  160)  =  e2[82*95(2oo  +  5-93 Q\0  +  29-62g22  +  80-6 Q20l  ]; 

PD(r,  40Ca)  =  d2[274TQoo  +  51-9fif0  +  259-5Q22 

+  22-5Qf2  +  403-OQoi  +  60-45#!  ],  (18) 

where  v(r)  =  27/5(5)1/2  exp  (- 1*8  /52r2);  p  =  2*02  fm-1.  In  (18)  Qn/(r)  are  the  relative 
wave  functions  of  a  pair  of  nucleons  in  the  nucleus.  Before  we  proceed  further,  a 
comment  is  in  order  about  the  use  of  the  term  ‘probability’.  In  this  paper  as  we  are  only 
studying  the  isobar  degrees  of  freedom  and  are  not  calculating  the  nucleon  and  hidden 
colour  components,  we  cannot  normalize  the  total  probability  to  unity.  In  this  sense  the 
use  of  this  term  is  improper.  Therefore,  from  now  on,  we  will  simply  use  the  term  ‘isobar 
degrees  of  freedom’  instead  of  ‘probability’  for  finding  an  isobar  pair  in  a  nucleus. 

We  notice  that  due  to  a  factor  v2(r),  PD(r)  in  (18)  is  maximum  at  short  distances  for 
which  (3-6)1 12 fir  =  1  -9,  /fi*  ~  1  or  r  ~  026  fm.  At  such  short  inter-nucleon  distances  in  a 
nucleus,  the  nucleon  three-quark  clusters  overlap  and  introduce  additional  isobar  and 
hidden  colour  degrees  of  freedom  in  the  nuclear  wave  function.  This  corresponds  to  a 
region  of  momentum  transfer  near  3*5  fm” l.  At  such  short  inter-particle  distances  only 
the  /  =  0  wave  functions  contribute  to  PD{r)  since  Q„i(r)  approaches  zero  for  /  =£  0  and 
for  /  =  0,  Qn0  (n  ^  0)  is  smaller  than  Q00  as  r  -►  0.  Thus  it  is  clear  that  the  isobar 
components  of  nuclear  wave  functions  are  important  only  at  small  inter-nucleon 
separations.  Similar  conclusions  are  arrived  at  for  the  hidden  colour  degrees  of  freedom 
in  a  nucleus.  It  is  expected  that  at  even  shorter  distances  than  this,  the  dominant  part  of 
the  nuclear  wave  function  will  be  the  hidden  colour  component  (Warke  et  al  1981). 
These  are  the  additional  degrees  of  freedom  in  nuclei  besides  those  of  nucleons  which 
are  brought  in  by  the  quark  structure  of  the  nucleon.  Some  experimental  indications 
support  the  observation  of  such  additional  degrees  of  freedom  in  nuclei.  The 
understanding  of  the  discovery  of  Gamow-Teller  resonances  in  (n,  p)  scattering  off 
nuclei  requires  the  presence  of  virtual  deltas  in  a  nucleus  (Goodman  1984).  It  is  expected 
that  there  could  be  a  significant  enhancement  of  the  virtual  pion  field  inside  a  nucleus  in 
the  large  momentum  transfer  processes  ~  2-5  fm”  \  close  to  the  distance  where  PD(r)  is 
maximum.  It  leads  to  the  prediction  of  an  enhancement  of  non-strange  components  of 
the  nuclear  sea,  which  can  be  made  to  agree  with  the  emc  data  at  small  x.  Here  we  are 
not  concerned  with  the  quantitative  prediction  of  our  theory.  Instead,  we  would  like  to 
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qualitatively  study  how  this  AA  component  of  the  nuclear  wave  function  varies  with  the 
nuclear  mass  number  A  for  doubly  closed  shell  nuclei. 

As  discussed  above,  we  neglect  the  non-zero  angular  momentum  contribution  and 
the  n  f  0  contribution  of  the  s-state  to  the  total  integrated  value  of  PD , 

PD(16  O)  =  82-95  [27/5  (5)1  /2  ]  2  /  (3*6  p2b2  +  1)3/2; 

PD(40Ca)  -  274*  1  [27/ 5(5)1/2]2/(3*6/?262  +  1)3/2.  (19) 

We  take  nuclear  shell  model  value  of  b  =  1-01  A1 16  fm.  The  study  of  the  coefficients  of 
Goo  in  (18)  for  different  nuclei  reveals  that  they  are  approximately  proportional  to  A4/3 
with  the  proportionality  constant  2-05.  Using  this  observation,  one  can  express  PD  in 
(19)  as  a  function  of  A  as  follows, 

Pd(A)  ~  0*2 ^4 5/6.  (20) 

The  total  isobar  degrees  of  freedom  in  a  nucleus  arising  from  the  nucleon  quark 
structure  increase  with  the  mass  number  A  as  the  5/6th  power  of  A.  The  additional 
isobar  degrees  of  freedom  per  nucleon  in  a  nucleus  would  be  0-2  T  “ 1/6.  If  we  associate 
these  additional  degrees  of  freedom  to  the  slight  increase  in  size  of  a  nucleon  in  the 
nucleus,  our  result  would  predict  that  this  increase  in  nucleon  size  inside  a  nucleus 
would  become  smaller  for  heavier  nuclei. 


4.  Conclusions 

The  quark  cluster  model  of  a  nucleus  is  developed  and  is  used  to  study  the  isobaric  AA 
degrees  of  freedom  in  nuclei.  The  internal  quark  cluster  wave  function  is  taken  from  the 
baryon  spectroscopic  data  analysis.  These  additional  degrees  of  freedom  are  found  to 
be  important  at  short  inter-nucleon  separation  phenomenon  in  a  nucleus  correspond¬ 
ing  to  the  region  of  momentum  transfer  near  3-5  fm-1.  They  are  brought  in  by  the 
quark  structure  of  nucleons  and  are  found  to  increase  with  the  nuclear  mass  number 
A516.  If  one  associates  with  this  effect  an  equivalent  increase  in  size  of  a  nucleon  in  the 
nuclear  medium,  our  result  would  predict  this  increase  of  nucleon  size  to  depend  on  the 
mass  number.  It  decreases  in  an  exponential  form  T~1/6  as  one  approaches  heavier 
nuclei.  Contrary  to  this  result,  most  predictions  are  linear  in  the  effective  density  of  the 
nucleus  (Arnold  et  al  1984;  Thomas  1984). 
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The  fifth  interaction:  universal  long  range  force  between  spins 
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Abstract.  In  this  paper  we  present  a  review  of  our  investigations  on  universal  long  range 
force  between  spins  mediated  by  a  massless  axial  vector  gauge  field  which  we  name  as  “axial 
photon”.  The  invariance  of  the  Lagrangian  field  theory  of  particles,  possessing  spin  degrees  of 
freedom,  under  local  Lorentz  transformations,  necessitates  the  introduction  of  such  an  axial 
vector  gauge  field  which  interacts  with  spin  current  of  the  particles.  Classical  as  well  as 
quantum  dynamics  of  electrons  interacting  with  photon  and  axial  photon  are  worked  out.  The 
new  interaction  is  found  to  be  asymptotically  free.  It  is  shown  that  QEDcan  be  made  finite  if  the 
coupling  strengths  of  electron  to  photon  and  axial  photon  can  be  made  equal.  Experimental 
consequences  of  the  existence  of  axial  photon  are  discussed  and  the  strength  of  the  interaction 
is  estimated  by  comparing  predictions  of  the  theory  with  experiments. 

Keywords.  Fifth  interaction;  axial  photon;  local  Lorentz  group;  divergences;  stability  of  the 
electron;  asymptotic  freedom;  hyperfine  anomaly;  phase  of  the  wavefunction. 

FACS  No.  12  90 


1.  Introduction 

There  are  at  present  four  kinds  of  known  fundamental  interactions  in  nature  namely: 
strong,  electromagnetic,  weak  and  gravitational.  It  is  the  present-day  conviction  that  all 
these  interactions  stem  from  certain  symmetries  and  are  mediated  by  gauge  fields  that 
become  necessary  to  restore  symmetry  when  the  parameters  of  global  transformation 
are  made  space-time  dependent.  For  instance  the  electromagnetic  field  is  required  to 
restore  phase  invariance  of  the  Lagrangian  when  the  parameter  of  the  phase 
transformation  is  made  space-time  dependent.  The  coloured  gluons  (Fritzsch  et  al 
1973),  the  IF  and  Z  bosons  (Glashow  1980;  Weinberg  1980;  Salam  1980)  that  mediate 
strong  and  weak  interactions  are  gauge  particles  associated  with  space-time  dependent 
SU(3)C  and  SU  (2)®U(1)  transformations  respectively.  The  gravitational  interaction  is 
thought  to  be  mediated  by  spin-2  massless  gauge  particle  required  in  space-time 
dependent  translation  group,  (Cho  1976;  Hayward  1979).  It  is  natural  to  speculate 
about  gauge  particle  associated  with  the  Lorentz  group.  Although  the  latter  has  been 
considered  by  various  authors  (Utiyama  1956;  Kibble  1961;  Sciama  1962)  who  obtain 
the  gravitational  interaction  by  gauging  the  Lorentz  group  it  is  somewhat  artificial.  In 
all  gauge  theories  the  gauge  particle  usually  couples  to  the  conserved  current  of  the 
global  group  which  is  gauged.  In  Einstein’s  theory  the  gravitational  field  couples  to 
energy-momentum  tensor  (Carmeli  1982)  which  is  the  conserved  current  for  the  space- 
time  translation  group.  It  is  therefore  natural  to  associate  gravitational  interaction  with 
the  local  translation  group.  In  the  Lorentz  group  the  conserved  current  is  the  angular 
momentum  tensor  density  and  one  would  therefore  expect  that  the  gauge  particle  in  the 
local  Lorentz  group  should  couple  to  this  density.  In  other  words,  a  local  Lorentz  group 
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should  lead  to  interaction  between  spins  mediated  by  an  axial  vector  gauge  boson.  We 
have  shown  that  this  is  indeed  the  case  (Naik  and  Pradhan  1981).  Our  approach  to 
gauging  the  Lorentz  group  is  different  from  those  of  Kibble,  Sciama  and  Utiyama.  We 
name  the  gauge  particle  as  “axial  photon”.  When  the  gauge  theory  is  worked  out  in 
detail  one  finds  that  axial  photon  couples  to  the  axial  vector  currents  of  the  Dirac 
particle  as  well  as  that  of  the  photon  and  the  space  integrals  of  the  space  components  of 
these  currents  are  the  spin  vectors  of  the  respective  particles.  We  thus  have  a  fifth 
interaction  which  operates  universally  between  spins  of  particles.  Further  we  have  a 
mechanism  of  dynamically  measuring  spin  through  axial  photon  probe. 

That  such  an  interaction  should  exist,  can  be  inferred  from  physical  considerations 
without  going  to  gauge  theory.  This  had  been  done  when  axial  photon  was  first 
introduced  (Pradhan  and  Lahiri  1974).  Their  motivation  for  introducing  the  axial 
photon  was  to  give  stability  to  the  classical  electron  and  also  to  make  quantum 
electrodynamics  divergence  free.  It  was  argued  that  if  an  axial  vector  boson  is  coupled 
to  the  axial  vector  current  of  the  electron  it  would  result  in  spin-spin  interaction  which 
is  attractive  for  parallel  spins.  The  parts  of  classical  spinning  electron  would  therefore 
attract  each  other  and  compensate  the  force  of  Coulomb  repulsion.  It  turns  out,  as  we 
shall  see  in  §  3  that  this  is  indeed  the  case  but  only  when  the  spin-spin  coupling  strength 
is  equal  to  the  strength  of  the  Coulomb  interaction.  It  also  turns  out  that  in  quantum 
theory  the  divergence  associated  with  electron-self-energy  due  to  exchange  of  virtual 
photon  cancels  with  that  arising  from  exchange  of  virtual  axial  photon  under  exactly 
the  same  condition  (Pradhan  and  Lahiri  1974).  However,  the  axial  photon  coupling  to 
electron  and  photons  with  such  large  strength  would  destroy  the  excellent  agreement  of 
present  quantum  electrodynamics  with  experiment  unless  of  course  the  axial  photon 
acquires  a  large  mass  through  spontaneous  breaking  of  the  gauge  symmetry.  This 
possibility  was  considered  by  Pradhan  and  Lahiri.  The  large  mass  suppresses  physical 
processes  involving  axial  photon  exchanges,  but  do  not  affect  divergent  diagrams. 

As  mentioned  earlier,  the  force  arising  out  of  exchange  of  axial  photon  between 
particles  with  spin  is  attractive  when  the  spins  are  parallel  and  repulsive  when  they  are 
antiparallel.  Strikingly  enough,  this  is  exactly  the  situation  with  the  experiments  carried 
out  with  polarized  laser  beams  by  Tam  and  Happer  (1977)  who  found  that  two  right 
circularly  polarized  laser  beams  passing  through  dense  sodium  vapour  attract  each 
other  while  those  with  opposite  circular  polarization  repel.  They  also  found  that  the 
force  between  the  beams  was  of  long  range.  The  sodium  vapour  acts  as  a  spin 
polarizable  medium  which  on  account  of  asymptotic  freedom  of  axial  photon  coupling 
enhances  the  spin-spin  force  between  laser  beams.  By  analysing  the  experimental  results 
of  Tam  and  Happer  and  considering  the  enhancement  by  spin-polarizability  of  the 
medium,  Naik  and  Pradhan  found  that  good  agreement  between  theory  and  the 
experimental  results,  over  the  range  of  densities  of  sodium  vapour  used  in  these 
experiments,  can  be  achieved  if  the  coupling  strength  is  taken  to  be  (xg  =  g2/4n  =  0-7 
x  1CT9. 

An  independent  estimate  of  the  coupling  strength  can  be  made  from  the  contribution 
of  the  long  range  force  between  nuclear  and  electronic  spin  to  hyperfine  splitting  in  the 
hydrogen  and  the  deuterium  atoms  for  which  only  a  small  discrepancy  exists  between 
theory  and  experiment.  This  discrepancy  can  be  accounted  for  by  the  long  range  spin- 
spin  interaction  if  the  axial  photon  coupling  strength  is  taken  to  be  ag  =  g2/4n  ~  10“ 13 
which  is  several  orders  of  magnitude  smaller  than  that  estimated  from  the  force 
between  circularly  polarized  laser  beams  in  sodium  vapour.  We  take  the  latter  value  to 


The  fifth  interaction 


79 


be  more  reliable  since  precision  of  measurement  of  hyperfine  splitting  is  much  higher 
than  that  of  the  laser  beam  experiment. 

In  view  of  these  two  experimental  evidences  for  the  existence  of  the  fifth  interaction 
we  think  it  worthwhile  to  review  the  classical  and  quantum  aspects  of  this  interaction. 
In  this  paper  we  begin  with  a  presentation  of  the  gauging  of  the  Lorentz  group  in  §2. 
The  classical  dynamics  of  the  theory  is  worked  out  in  §  3.  Equations  of  motion  for  the 
axial  photon  field  and  its  spin  source  are  worked  out  and  their  static  solution  for  point 
sources  obtained.  Stability  of  the  classical  electron  and  radiation  of  axial  photon  by 
spin  sources  are  discussed  in  the  classical  framework.  Section  4  is  devoted  to  the 
quantum  dynamics  of  the  theory  where  questions  such  as  the  removal  of  divergences, 
asymptotic  freedom  of  the  interactions,  its  contribution  to  hyperfine  splitting  and  two- 
body  bound  states  resulting  from  long  range  spin-spin  forces  are  discussed.  In  this 
section  we  also  discuss  a  Bohm-Aharonov  type  of  experiment  which  can  establish  the 
existence  of  the  fifth  interaction  as  well  as  measure  its  strength. 


2.  Local  Lorentz  group 


Invariance  of  Lagrangian  density  under  global  Lorentz  group  of  transformations: 


*0-**i  =  */*  +  aA*v*v 

► 


a 


n\ 


J 


leads  to  conservation  of  angular  momentum  JMV: 

dJ „  “  ^ 


.  —  o-  J 

dt  5 


=  0 


where  vA  is  the  angular  momentum  density.  As  mentioned  in  §  1  one  expects  that  in 
the  local  Lorentz  invariant  theory  the  gauge-field  would  couple  to  the  conserved 
current  Mm  vA  .  This  implies  that  the  gauge  field  would  be  a  third  rank  tensor  .  That 

this  is  actually  the  case  can  be  seen  as  follows: 


The  above  Dirac  Lagrangian  density  which 
under  which, 

f *  t- < 

^  (x)  ->  p  (x)  =  exp  -2.vAavA 


Zva  =  7  (w*  -  Vat\) 


is  invariant  under  global  transformation  (1) 
\ j/(x)  =  Qi//(x) 

(4) 


ceases  to  be  so  when  the  parameters  a^v  are  made  space-time  dependent.  However,  the 
Lagrangian 

&D  =  h  -  (£>>)?„'/']  -  '#'/'> 

where  =  8$  +  igB 


(5) 

(6) 
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remains  invariant  when  the  massless  gauge  fields  B M  transform  as: 


B„(x)-  B'^x')  =  Sl(x)Bli(x)Cl-1(x)-Udlitl(x))n-1(x).  (7) 

At  this  stage  it  is  necessary  to  point  out  that  in  the  above,  we  have  considered  the 
invariance  of  the  Lagrangian  density  which  for  global  transformation  ensures  the 


invariance  of  the  action  S  = 


» 

d4Xo^(x).  For  local  transformation  this  is  not  the  case; 


one  needs  extra  conditions  because. 


S 


d4x.S?(x)  ->  S'  = 


d4x'i?'(x') 


J 


d4x  JJzf”(x'), 


where  J  is  the  Jacobian  for  the  transformation  of  the  four-volume  element.  Since  J  =  1 
+  d p  ( Sx M)  one  requires  both 

<£\x!)  =  &(x) 

y  (9) 

d/SxJ  =  0.  V  ' 

The  latter  is  automatically  satisfied  for  global  transformation.  For  our  local  Lorentz 
group  this  is  equivalent  to, 

=  °-  <10) 


In  the  Kibble-Sciama  (Kibble  1961;  Sciama  1962)  approach  to  local  Lorentz  group 
this  restriction  is  not  put,  they  use  vierbeines  to  define  covariant  derivatives  and  their 
theory  leads  to  Einstein-Cartan  theory  of  gravitation. 

The  tensor  gauge  field  Bm  vA  is  antisymmetric  in  the  last  two  indices.  We  can 
decompose  it  into  symmetric  and  anti-symmetric  parts  in  the  first  and  last  two  indices 
as: 


=  2^ 


(11) 


where  a  is  an  axial  vector  field  and, 

=  2  (^,vA  +  Bv,nX  + 


is  a  symmetric  tensor  field  which  on  account  of  (10)  transform  as, 
a^x)  -  a'^x’)  =  a^x)  +  2/3 aflv(x)av (x)  - A 


where  =  e^d,  A 

and  S£>  A  -  S'™  =  SI  (x)  *  SI  ~ 1  (x). 


We  shall  name  the  axial  vector  field  as  “axial  photon”  field.  Substituting  (6)  and  (11)  into 
(5)  we  obtain 


O?  _  <^(  0)  I  (£>{  I) 

JL  i)  —  p  i  =2-  [) 

where  &*§  =  gM^B^ 

Or  =  ^-  e„v;ic ' J  (r„Zvi  +  a c 


(13) 
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which  is  equivalent  to  taking 

=  djl*  -2/2gy5'l'all. 


(14) 


It  will  be  noticed  that  the  tensor  field  S  A  does  not  couple  to  the  Dirac  field;  only  the 
axial  vector  field  a M  couples  to  the  axial  vector  current  of  the  Dirac  field.  The 

space  integral  of  the  space  components  of  this  current  is  the  spin  of  the  Dirac  particle 
and  that  of  the  time  component  its  helicity.  It  thus  follows  that  the  axial  vector  field 
couples  to  the  spin  and  helicity  of  the  Dirac  particle  just  as  the  Maxwell  field  couples  to 
its  electric  charge  and  current.  Thus  axial  photon  provides  a  means  of  dynamical 
measurement  of  spin  in  the  same  sense  as  photon  measures  the  charge  of  the  electron. 

The  kinetic  energy  term  of  the  gauge  field  must  now  be  included  in  the  Lagrangian.  It 
is  given  by 

(15) 


se< 


=  ~\B^B^ 


where 


B„v  =  ^Sv-avBp  +  fe[B„  B.] 
B ,  = 


J 


(16) 


The  axial  photon  part  of  is: 


3  g- 


(17) 


Since  the  photon  has  spin  it  will  also  couple  to  axial  photon.  To  obtain  the  interaction 
Lagrangian  we  start  with 


se 


M 


where 

with 


D,AV 


-i  (D„Av-DvAJ(D^Av-DvA,l 

^uAv  +  ig Xrj  Ar, 

~~  j(^va^»i  —  ^vA;.)* 


(18) 

(19) 

(20) 


Substituting  (11)  and  (20)  in  (19)  and  evaluating  (18)  gives  the  following  interaction 
Lagrangian  between  photon  and  axial  photon 

&mg  =  ~9e„yxi(S„Av)Aiai-^92[a2A2-(aA)2].  (21) 


It  will  be  noticed  that  this  Lagrangian  is  not  invariant  under  the  electromagnetic  gauge 
transformation 


A' 


(22) 


This  can  be  rectified  by  replacing  Au  occurring  in  the  Lagrangian  by  (A^  +  XdM$>)  where 
<I>(;t)  is  a  massless  scalar  field  which  transforms  as 


<t>(x)  ->  </>'(*)  = 


Xe 


a(x). 


(23) 


In  order  to  ensure  the  invariance  of  the  kinetic  energy  term  (dfl>)  of  the  scalar 

field  we  add  a  counter  term  —  to  the  Lagrangian.  In  order  to  decouple  the 

scalar  field  from  the  rest,  one  goes  to  the  limit  X  -*  0.  This  has  been  discussed  in  detail 
(Naik  and  Pradhan  1981)  and  is  a  slightly  modified  version  of  Stueckelberg’s  (1938) 
compensating  field  method. 


P.-  6 
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3.  Classical  dynamics  of  axial  vector  gauge  field 


3.1  Equations  of  motion  for  the  field  and  its  source 

The  Lagrangian  density  for  the  field  interacting  with  its  spin  source  as  worked  out  in  the 
preceding  section  can  be  written  as 

jS?  =  +  if(1)  =  i?(0,  +  £e{1\  (24a) 


where 


^(0)  =  -  i(<yo2  -  F„ + ic*r  A*  -  (^)7>]  -  "W, 

&{i)=  -  ig2  [a2  A2  -  (a- +  sjx^x), 


(24b) 

(24c) 


where 


«„(*)  =  3/2#y5y„i/'  -  gs^^A^,  (24d) 

is  the  spin  current  density  of  the  electron  and  the  photon.  If  we  add  a  four  divergence 
jd^l(avdvafl)  to  the  if (0)  in  (24b),  the  kinetic  energy  of  the  axial  photon  can  be  written  as 

JZXaxia  1)=  -I/^v,  (25a) 

where  /„„  =  d„av  -  3va„,  (25b) 

and  the  equations  of  motion  that  follow  from  the  Lagrangian  are, 

5,  =  s„(x) 

f  (26) 


5v/at  ~  0 


7 


if  we  neglect  nonlinear  terms  of  order#2  in  (24c).  In  terms  of  S{  —  foi  and^  =  j£ijkfjk 
these  equations  read 


V  xS— 


1  d@ 


c  dt 

IdS 


0,  \-<%  =  0 


Vx  ^  — —  =  s(x)  V  S  =  h(x) 
c  dt 


(27) 


where  h(x)  —  s0(x)  is  the  helicity  density  and  together  with  spin  density  s(x)  obeys  the 
continuity  equation: 


=  0. 


(28) 


The  equation  for  the  axial  vector  potential  a M  that  follows  from  (24b)  when  terms  of 
order  #2are  neglected,  are: 


□  %  =  sM(x). 

This  equation  can  also  be  regarded  as  Hamilton’s  equations: 


(29) 


where 


diii 

dt 


=  d; 


SH 


JS(djai) 

SH 

da{ 


SH 

Saf 


(30) 


(31) 
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is  the  field  canonically  conjugate  to  at  and 

+  +  (32) 

H0  being  the  energy  of  the  bare  source.  It  will  be  seen  that  the  axial  vector  field  couples 
to  the  spin  of  the  source. 

The  equation  of  motion  for  the  spin  of  the  source  can  be  obtained  by  using  the 
Poisson  Bracket  equation: 


dSJdt  =  i  [H,  SJ. 
Using  the  definition, 


S = 


d3  x  s,(x), 


and  the  ansatz 

Oi(*)>  =  ieijkS(x  -x')sk(x), 

yields 


dSJdt  =  -  eijk 


d3  x  Sj(xt)ak(xt). 


(33) 

(34) 


(35) 


This  equation  shows  that  the  axial  photon  field  exerts  a  torque  on  the  spin  the  direction 
of  which  is  perpendicular  to  the  field  potential  a  as  well  as  to  the  spin  of  the  source  so 
that  S2  =  constant.  If  a  were  space-time  independent,  the  solution  of  (35)  would  give  a 
gyration  of  the  spin  like  that  of  a  spinning  top  in  a  gravitational  field.  This  situation  is  to 
be  contrasted  with  the  interaction  of  the  photon  field  on  a  charged  particle  which  gets 
accelerated  by  it. 


3.2  Static  field  of  a  spin-source 
Solution  of  (29)  can  be  written  as 

1 


af(x)  = 


4n 


dv  s-(x,) 


X  —  X 


(36) 


For  the  spin  distribution  we  can  take 

Si(x)  =  0P(x)Sj,  (37) 

where  p(x)  is  the  spin  distribution  function.  Substitution  of  (37)  in  (36)  leads  to 


a,(x) 


4n 


d3x 


,  0P(x') 


X  — X 


(38) 


For  a  point  spin  source  p(x')  =  3/2 6(x') 

i  t  3gS‘ 

fl((x)  = 


8w|x|  ’ 


and 


&  =  —  V  x  a  = 


3g  (x  x  s 

8?r  V  “P5- 


(39) 


(40) 


3t  =  —  da/dt  =  0. 


(41) 
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The  situation  here  is  to  be  compared  with  static  photon  field  of  a  steady  electric  current 
which  produces  a  steady  magnetic  field  but  no  electric  field.  It  will  be  seen  from  (40)  that 
for  a  right  circularly  polarized  light  beam  travelling  in  a  straight  line  for  which  the  spin 
is  in  the  direction  of  motion,  the  £  lines  of  force  will  be  circles  around  the  beam  axis 
just  like  magnetic  lines  of  force  (B)  around  a  straight  wire  carrying  current.  On  the 
other  hand  for  circularly  polarized  light  beam  travelling  in  a  circular  path,  the  £  lines 
of  force  would  be  on  the  surface  of  a  torous  around  the  path  like  those  of  B  lines 
around  a  circular  current.  It  then  follows  that  the  £  lines  for  an  infinitely  long  helical 
light  path  as  in  a  selfoc  fibre  would  be  exactly  like  the  B  lines  produced  by  a  solenoid, 
i.e.  the  £  lines  would  be  parallel  to  the  fibre  and  confined  to  the  inside  of  the  fibre. 


3.3  Stability  of  classical  electron 

It  is  well  known  that  classical  finite  size  electron  is  not  stable;  its  parts  would  fly  apart 
due  to  repulsive  Coulomb  interaction  which  cannot  be  prevented  unless  forces  of  non- 
electromagnetic  origin  are  invoked.  It  will  readily  be  seen  that  such  forces  would  arise 
from  the  new  interaction.  The  static  fields  produced  by  electrons’  spin  and  electric 
charge  would  be 


er 


E  =  —3,  B  =  0;  £  = 
4  nr 


3  g  r  x  s 
8ti  r 


3  » 


<2  =  0. 


(42) 


The  E  and  £  lines  of  force  are  shown  in  figures  I  a,  b.  These  fields  would  give  rise  to  a 
self-force: 


3  g  e2r  9g2S2r 

F=,E-Tsx<f=5--3 

which  can  be  made  to  vanish  by  suitably  choosing  "g" 
0/2g)2  =  e2jS2  or  ^g2  =  ~. 


(43) 


(44) 


This  would  ensure  stability  of  the  electron.  For  an  extended  electron  calculations  will  be 
somewhat  more  complex.  We  thus  see  that  the  fifth  interaction  can  provide  a  “Poincare 
stress”  for  ensuring  stability  of  the  classical  electron. 


3.4  Static  interaction  between  two  spins 

The  potential  energy  between  two  particles  possessing  spins,  one  located  at  the  origin 
and  another  at  r  is  given  by 

vAb( r)=  -  JdVs|B)(r  -  r')a^)(r').  (45) 

Using  expression  (39)  for  a{( r)  in  (45)  we  obtain  after  some  manipulations, 

vab  M  =  [S/4‘Sb],  (46) 

Ztz  j  r  | 

which  shows  that  the  force  between  two  spins  is  attractive  when  the  two  are  parallel 
while  it  is  repulsive  when  they  are  anti-parallel.  This  can  be  considered  as  the  Coulomb 
law  for  the  fifth  interaction.  It  is  important  to  note  that  identical  (parallel)  spins  attract 
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E  Lines 


Figure  1.  (a)  Electric  lines  of  force  due  to  change  of  the  electron,  (b)  Axial-electric  lines  of 
force  due  to  spin  of  the  electron. 


while  identical  charges  repel.  Therefore  for  an  extended  particle  with  charge  and  spin, 
the  instability  caused  by  repulsion  of  different  parts  of  the  charge  distribution  would  be 
opposed  by  the  attraction  between  parts  of  spin-distribution.  We  have  already  seen  this 
from  a  slightly  different  consideration  in  §3.3. 

3.5  Radiation  of  axial  photon 

Solution  to  the  wave  equation: 

,  1  da 

V2  a  — j —  =  4ngs  (r)  (47) 

c  at 


can  be  written  as 


a(rf)  = 


/*  i* 


d  V 


dr' 


For  the  sinusoidal  time  variation  of  the  spin  density 


(48) 


s(r,  r)  =  s(r)exp(-icof) 
equation  (48)  becomes, 


fj3  ,  ,  „exp[ik(r-r')] 
dVs(r ) - - - - - 

r  —  r 


a(r)  = 


(49) 
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With  k  —  co/c  and  for  the  limit  kr  >  1  this  reduces  to 


_  exp(ikr) 
lim  a  (r,  0)  = - — - 

kr-*  oo  f 


d  V  s  (r')  exp  ( —  ikn  •  r' ), 


(50) 


where  n  is  a  unit  vector  along  r.  For  the  case  where  the  dimensions  of  the  source  are 
small  compared  to  wavelength  we  can  expand  the  exponential  in  (50)  in  powers  of  k  and 
the  first  term  in  the  expansion  namely 


a(r)  =  eXP,(ik  r)  ^3 


dVS(r»  =  CXpl,krlS 


(51) 


gives  the  dominant  contribution  to  the  radiation  of  axial  photon.  Since  eS/mc  =  fiis  the 
Dirac  magnetic  dipole  moment,  this  radiation  can  be  termed  as  the  magnetic  dipole 
radiation  of  axial  photon.  The  fields  £  and  &  outside  the  source  derived  from  (51) 
work  out  to  be 


and 


lf3  mc2g\ /  exp(ikr)/  1 
£  —  —  V  x  a  =  i/c(  — - — —  |(n  x  n) - — —  -(  1 


2e 


ikr 


£  =  —  — V  x  e 
k 


a  =  - 


3  gmci 
2e 


(n  x  (i)  x  n/c2  +  [3n(n  )k2  -  ft] 


1  ik\\  exp(ikr) 
j  ikr 


xi  -y - 

r  r 


In  the  radiation  zone,  they  take  the  limiting  values: 


(52a) 


(52b) 


£= 


0t  = 


^,^j(nx^)xJexp('k'r) 

—  \  (n  x  n)  x  n  ^  exp(lk  r) 


(53a) 


(53b) 


Thus  the  time-averaged  power  radiated  works  out  to  be 


c  „  _  ,  _ .  ck2  ( 3mc2g 


=  -^[r2n-(£X  <£*)]  = 


p|2sin2  6, 


dD  87r 


871  V  2e 


(54) 


where  6  is  the  angle  between  the  direction  of  radiation  with  respect  to  p.  The  total 
power  radiated  is: 

ck2  / 3 mc2g 


&  = 


nl2- 


3  V  2c 

The  formula  for  the  counter  part  in  magnetic  dipole  radiation  of  photons  reads, 

ck 4 


(55) 


&  = 


(56) 
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4.  Quantum  dynamics  of  axial  vector  gauge  field 


4.1  Two-body  force  and  asymptotic  freedom 

The  force  between  two  particles  with  spins  and  S#  is  mediated  by  axial  photon 
exchanged  between  them.  The  two-body  potential  has  been  worked  out  (Naik  and 
Pradhan  1981)  using  Coulomb  gauge  axial  vector  propagator.  It  is  more  convenient  to 
use  the  Feynman  gauge  propagator 

D^  =  gJ(.q2-w2lc2).  (57) 

The  two-body  potential  would  then  have  the  form 

(Bq)2 

vAB=  -^-(SrS  B)  (58) 

&nr 

where 


3,  A  and  B  fermions, 

73,  A  is  boson,  B  fermion. 


(59) 


For  p  =  3  it  is  identical  to  the  classical  expression  (46).  The  two-body  force  being 
attractive  for  parallel  spins  has  far-reaching  consequences.  A  test  body  with  spin,  in  a 
spin-polarizable  medium  would  be  antiscreened  by  the  medium  rather  than  screened 
like  a  test  charge  in  a  charge-polarizable  medium.  This  is  because  the  test  spin  would 
attract  other  spins  parallel  to  it  and  repel  those  antiparallel  to  it.  Thus  the  effective  spin- 
spin  force  in  a  spin-polarizable  medium  would  be  stronger  than  in  vacuum.  In  other 
words,  the  force  will  decrease  as  one  gets  closer  to  the  test-body.  Thus,  the  effective — 
axial — photon  coupling  constant  would  decrease  as  momentum  transfer  is  increased. 
Such  forces  are  called  asymptotically  free  forces.  This  conclusion  will  be  confirmed 
from  a  calculation  of  the  Callan-Symanzik  ^-function  in  a  later  section. 


4.2  Cancellation  of  divergence  in  QED 


We  shall  first  consider  the  problem  in  the  non-relativistic  limit.  The  ground  state  self¬ 
energy  of  an  electron  in  the  second  order  of  perturbation  theory  is  given  by  (Bethe  and 
Salpeter  1957) 


AF0 


<  0 1 /// 1 m  >  <n\Hj\oy 

En-E0  +  a> 


(60) 


^  p  ■  A  3 

where  H ,  = - h-gaa. 

m  2 


(61) 


Inserting  (61)  in  (60)  gives 


A  E  = 


167T 


I 


(*  oo 


d  coco 


nv 


e  <  0  j  p.  t  n  >  <  n  I  Pi  1 0  >  +(^l  <0|ffi|n>  <  m|ct,-|0  > 


n  J 


0 


£„  -  £0  +  w 


(62) 
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which  can  be  written  as 


1 


00 


A  E  =  X  dco  CO 

071 


—  e 


m4 


{E0-En)  <0|Pi|n>  <  n\Pi\0  > 


c o(co  +  En-E0 ) 
e2/m2  <  0|pt-|w  >  <«|Pi|0> 


n  JO 
2  /^2 


CO 


+ 


(ig/2): 

6n2 


I 


m 


r  oo 


dco  <  Olajm  )  (mlcrJO), 


(63) 


o 


where  m  stands  for  the  spin  sublevels  of  the  ground  state.  It  will  be  noticed  that  the  first 
term  in  the  above  expression  is  convergent  while  the  last  two  terms  are  linearly 

divergent  and  would  cancel  with  each  other  if  =  e2- 

We  shall  next  consider  the  relativistic  case.  The  free  electron  self-energy  contributed 
by  the  photon  and  axial  photon  shown  in  figures  2a,  b, 

>2  r 


X»  = 


le 


(2  n)‘ 


,4 if(p—k)—m 


( p-k)2  +  m: 


7,Du,(k) 


(2»r 


(P  ~  k)2  +  nr 


(64) 


from  which  after  a  certain  amount  of  calculation  one  obtains  the  self-mass. 


Sm 


oipm  ( 3D  9\  <x*m  ( 3D 


271 


2  +  4  )  (2k)  I 


-5/4 


w'here 


OL  =  -  CL *  = 

e  An  9 


(3/2g)2  9 

4  n  4  9 


(65) 


It  will  be  noticed  that  the  divergent  parts  in  Sm  cancel  if 
e2  =(3>g/2)2. 


4.3  Callan-Symanzik  beta  function 

The  asymptotic  freedom  of  the  axial  photon  interaction  can  be  confirmed  by 
calculating  the  Callan-Symanzik  ^-function  (Callan  1970;  Symanzik  1970).  For  the 
electron  axial  photon  interaction 


m  = 


-3  g  8 


2  8  In  A 


(In  2V2), 


(66) 


k 


P  (p-k) 


P 


Co) 


CtO 


Figure  2.  (a)  Photon  contribution  and  (b)  axial-photon  contribution  to  the  self-energy  of 

the  electron. 
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where  23  is  the  renormalization  constant  associated  with  axial  photon  self-energy. 
Explicit  calculations  of  the  relevant  diagrams  (Naik  1980)  gives 


~  2a„  A2 

23  =  1  +  ,  9  In  , 

3n  m 2 

(67) 

from  which  we  get 

m  =  )-  +  0(gs). 

(68) 

The  negative  sign  confirms  the  asymptotic  freedom  of  the  electron-axial-photon 
interaction. 


4.4  Contribution  of  axial-photon  interaction  to  hyperfine  anomaly 

The  ratio  of  frequencies  of  hyperfine  splitting  in  deuterium  and  hydrogen  atoms  is 
given  by  (Bethe  and  Salpeter  1957) 


Vd 


3  / Mq  V  2gD 

4  gH 


(1  +  A), 


(69) 


where,#D  andJ(H  are  reduced  masses  and  gD  and  gp  are  Lande  g-factors  for  deuterium 
and  hydrogen  respectively.  A  represents  nuclear  structure  and  relativistic  recoil-effects, 
which  have  been  calculated  from  theory.  It  is  found  that  there  exists  a  very  minute 
discrepancy  between  the  experimental  and  theoretical  values  of  A; 


theory  =  28  X  1(T5, 

Aexp,  =  17  X  10"  5. 


(70) 


The  hyperfine  splitting  discussed  above  is  taken  to  be  due  to  interaction  between  the 
electron’s  spin  magnetic  moment  and  the  nuclear  magnetic  moment.  It  is  a  short  range 
interaction  between  their  spins.  The  discrepancy  could  be  ascribed  to  the  existence  of 
long  range  interaction  between  spins  of  the  electron  and  the  nucleus  caused  by  axial- 
photon  exchange.  We  therefore  take  this  extra  interaction  into  account  in  the 
calculation  of  hyperfine  splitting. 

Taking  the  nuclear  spin  as  I  and  the  electron  spin  magnetic  moment  as  p,  the 
standard  hyperfine  interaction  Hamiltonian  can  be  written  as  (Bethe  and  Salpeter  1957) 


=  -  2^o 


3^(In)<5(r)  +  i-(In 


3(Ir)  (fi-r) 


(71) 


where  k  is  the  orbital  angular  momentum  of  the  electron  and  pQ  —  e/2 m,  a  Bohr- 
magneton.  The  expectation  value  of  J^c  gives  the  lowest  order  contribution  to  the 
hyperfine  splitting. 

Now  to^c,  we  add  the  long  range  spin-spin  part  of  the  interaction  so  that  the  total 
interaction  is  given  by 


***  =  ^ 


(fa)2 

87cr 


SI, 


(72) 


where  S  is  the  electron  spin;  I  is  nuclear  spin. 

The  expectation  value  of  the  additional  term  is  easily  seen  to  be 
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m2  / 1 

87c  \  r 
For  hydrogen-like  atoms 
l\  2  z 


F2  —S: 


;  F  =  I +  S. 


(73) 


r  /  cten 


^v;  = 


2  >’ 


e  4n 

Thus  the  final  result  of  the  expectation  value  is 


(74) 


P 


2,  ag\  z  /F(F+l)-/(/+l)-S(S+l) 


oce  >n‘ 


Ry  ;  ccg 


9' 


4n 


(75) 


For  hydrogen  atom, 

S  =  1/2,  I  =  1/2;  F  =  1,  0;  p  =  3 


A £#>  =  -91  ^  )RV. 


a. 


cc< 


(76) 


For  deuterium  atom, 


S  =  1/2;  I  =  1;  F  =  1/2,  3/2;  0  =  ^3 


A  £'d9)  =  -  9/2 


a. 


a. 


(77) 


These  have  to  be  added  to  the  existing  Fermi  formula  (Bethe  and  Salpeter  1957,  p.  196) 


leading  to 


A  £#otal)  =  A  E'hc)  +  AEffl, 
A£k,otal)  =  A£^  +  A£g». 


(78) 


(79) 


(80) 


Taking  effects  of  nuclear  motion  and  structure  one  gets  for  the  ratio  of  splitting  in 
deuterium  and  hydrogen 


vd 
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3  (dto  V 

4  v^h/ 
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9d9d 
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9p9n  \  9x3 
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or 


where, 


-  =  — d+A-A')> 

vh  4  \Mh)  Qp 


9  ct 

A'  =  -  - 
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1 


9d9d  9p9p 


(82) 
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It  will  be  seen  that  the  theoretical  value  of  the  ratio  is  reduced  as  a  result  of  taking  long 
range  spin-spin  force  into  account.  This  is  what  is  required  to  obtain  agreement  with 
experimental  result  which  requires  A'  =  10  ~4.  For  this  to  be  satisfied  we  need 
—  10  which  is  several  orders  of  magnitude  smaller  than  obtained  from  laser  beam 
experiment. 


4.5  Electron-neutron  and  two-neutron  bound  states 


Neutron  and  electron  as  well  as  two  neutrons  can  form  bound  states  when  their  spins 
are  parallel  so  that  the  force  between  them  is  attractive.  The  Dirac  equation  for  an 
electron  whose  spin  interacts  with  that  of  a  neutron  can  be  written  as, 


a  p  +  pm  —  (  \  1^  =  0, 


(84) 


where 


1  = 


O' 


0 


and  S  = 


O' 


0 


are  spins  of  electron  and  neutron  respectively.  The  latter  has  been  taken  to  be  infinitely 
heavy  compared  to  the  electron.  After  separating  the  angular  and  radial  part  of  i ji  in  the 
conventional  manner  (Akhiezer  and  Berestetskii  1965) 


^ jlM  ~ 


(  ?nn"“!n!V-2'-';"-r 


One  gets  the  radial  equations: 


dr  +('-Z^)F  +  (E-m  +  (3/2g)2—  |G  =  0. 


(85) 


(86) 


(87) 


Since 


a-S=l[J(d  +  l)  —  S(S+  1)  — <t(<t+  1)] 
=  l(J(J'+l)-3/2), 


(88) 


where  J  =  a  +  S;  a*S  =  1/4  for  J  —  1  and  a  S  =  -  3/4  for  J  =  0  it  is  easy  to  see  that 
bound  state  can  be  formed  when  spins  are  parallel.  We  shall  obtain  solution  of  (71)  for 
J  =  1  in  which  case  a  S  =  1/4.  The  allowed  energies  are, 


E„  =  m 


—  m 
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1  + 

1  r 

(3A?)4 


(3/2g)2/16 


(3/2  gf 


16 
(3/2  gf 


-f  nr 


1/2 


(89) 


n 


3 

4 


(90) 


32m"  512m4  \l+  1/2 

On  account  of  the  smallness  of  g2  one  can  ignore  the  last  term  in  which  case  the  ground 
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state  binding  energy  is  given  by 

B.E.  ~  81m#4/16  x  32,  (91) 

which  is  the  Bohr  formula.  If  we  put  <xg  =  g2/4n  ~  10“ 13  this  works  out  to  be 
~  10" 21  eV  and  the  Bohr  radius  7  x  102  cm.  For  the  dineutron  one  gets  for  the  ground 
state  binding  energy 


B.E. 


81  Mg4 
16x64 


10“ 18  eV 


(92) 


and  a0  =  (8  x  4)/(9  x  g2  x  M)  ~  10“ 1  cm.  These  bound  states  cannot  exist  under 
ordinary  conditions  of  temperature  and  pressure  because  they  will  be  easily  ionized  by 
the  thermal  energy  and  collisions.  They  can  exist  at  very  low  temperatures  and 
pressures. 


4.6  Aharonov-Bohm  type  of  experiment 

Aharonov  and  Bohm  (1959)  discussed  the  question  of  the  effect  of  electromagnetic 
potential  on  the  phase  of  the  quantum  mechanical  wavefunction  of  the  electron  and 
their  conjecture  has  been  experimentally  confirmed  (Chambers  1960).  Such  effect  on 
the  quantum  mechanical  phase  of  the  neutron  by  earth’s  rotation  have  recently  been 
observed  (Werner  et  al  1979).  We  shall  in  this  section,  discuss  the  effect  of  the  axial 
photon  potential  on  the  quantum  mechanical  phase  of  the  electron  and  propose  an 
Aharonov-Bohm  type  of  experiment  to  detect  this  effect.  Its  positive  result  would  be 
definite  evidence  for  the  existence  of  the  fifth  interaction. 

In  the  case  of  the  electromagnetic  local  gauge  group,  the  covariant  derivative, 


D^  =  (d,  +  ieA^  (93) 

suggest  that  the  phase  of  the  wavefunction  in  the  presence  of  the  potential  A ^  would  be 


\p(x)  =  [exp(ie 


(94) 


where  ip0(x)  is  the  wavefunction  in  the  absence  of  the  potential.  From  the  covariant 
derivative  for  the  fifth  interaction: 


D =  (d„  +  3/2(i)20y5a|1)i/' 
it  follows  that, 

3  r 

vHx)  =  |exp  +yi)gy5 


dx  a 

/i  n 


(95) 

(96) 


In  the  Aharonov-Bohm  experiment  electrons  from  a  point  source  are  made  to  pass 
through  a  double  slit  so  as  to  form  an  interference  pattern  on  a  screen.  A  solenoid  placed 
between  the  slits  and  the  screen  which  produces  no  magnetic  field  but  only  a  vector 
potential  outside,  causes  a  displacement  of  the  fringe  system  since  the  potential  brings 
about  a  phase  difference  t]  given  by, 


tj  —  e 


r 

dSB 


%J 


(97) 


between  the  two  beams.  In  our  case  we  replace  the  solenoid  by  a  long  straight  selfoc 
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fibre  through  which  a  circularly  polarized  laser  beam  is  passing.  As  discussed  in  §3.3 
this  fibre  would  produce  an  axial  electric  field  £  and  an  axial  vector  potential  outside.  It 
would  therefore  cause  a  fringe-shift  on  account  of  phase  change 


n  =  (30) 


dsc. 


(98) 


j 


T  herefore  the  occurrence  of  a  fringe-shift  on  passing  circularly  polarized  light  through 
the  selfoc  fibre  placed  between  slits  and  the  screen  would  be  a  conclusive  evidence  for 
the  existence  of  the  fifth  interaction.  Further,  quantitative  measurement  of  this  fringe- 
shift  would  lead  to  a  determination  of  the  strength  of  this  interaction. 


5.  Concluding  remark 

The  introduction  of  the  fifth  interaction  has  been  motivated  by  a  gauge  theory 
approach.  If  we  believe  in  the  philosophy  that  all  the  fundamental  interactions  are 
dictated  and  determined  by  gauge  principle  according  to  which  the  conserved  Noether 
current  of  a  global  symmetry  couples  to  the  gauge  field  which  results  from  the 
corresponding  local  gauge  symmetry,  it  then  follows  that  spin  angular  momentum 
must  couple  to  an  axial  vector  gauge  field  thereby  giving  rise  to  the  fifth  interaction.  We 
have  discussed  two  experimental  evidences  for  the  existence  of  such  an  interaction  and 
proposed  an  Aharonov-Bohm  type  of  experiment.  It  is  found  that  the  interaction  is  very 
weak  ocg  ~  10“ 13.  This  is  why  one  does  not  easily  perceive  its  effects  in  quantum 
electrodynamic  phenomena. 

The  confirmation  of  the  fifth  interaction  would  have  far  reaching  consequences.  It 
promises  to  solve  the  problem  of  occurrence  of  the  divergence  in  quantum  elec¬ 
trodynamics.  It  would  make  the  classical  electron  stable  by  providing  the  “Poincare 
stress”.  And  last  of  all  it  provides  a  dynamical  method  of  the  measurement  of  spin  in  the 
same  sense  that  photon  provides  a  measurement  of  the  charge.  The  superweak  nature 
of  the  interaction  indicates  that  it  will  play  a  major  role  in  very  long  wavelength  and 
low-energy  domain  and  very  long  distant  future  of  the  expanding  universe. 


The  authors  would  like  to  acknowledge  fruitful  discussions  with  the  members  of  the 
theory  group  of  the  Institute  of  Physics,  Bhubaneswar. 


References 

Akhiezer  A  I  and  Berestetskii  V  B  1965  Quantum  electrodynamics  p.  144 
Aharonov  Y  and  Bohm  D  1959  Phys.  Rev.  115  485 

Bethe  H  A  and  Saipeter  E  E  1957  Handbuch  der  Physik  Atome  I  (Berlin:  Springer  Verlag) 

Callan  C  G  1970  Phys.  Rev.  D2  154 

Carmeii  Moshe  1982  Classical  fields  ( general  relativity  and  gauge  theory)  (New  York:  Wiley  Interscience) 
Chambers  R  G  1960  Phys.  Rev.  Lett.  5  3 
Cho  Y  M  1976  Phys.  Rev.  D14  2521 

Fritzsch  H,  Gel-Mann  M  and  Leutwyler  H  1973  Phys.  Lett.  B47  365 


94 


T  Pradhan,  R  P  Malik  and  P  C  Naik 


Glashow  S  L  1980  Rev.  Mod.  Phys.  52  539 
Gell-Mann  M  and  Low  F  F  1954  Phys.  Rev.  95  1300 
Hayward  J  1979  Phys.  Rev.  D20  3039 
Kibble  T  W  B  1961  J.  Math.  Phys.  2  212 

Naik  P  C  1980  Quantum  electrodynamics  with  axial  photon;  Utkal  Univ.  Ph.D.  thesis 
Naik  P  C  and  Pradhan  T  1981  J.  Phys.  A14  2795 
Pradhan  T  and  Lahiri  A  1974  Phys.  Rev.  DIO  1872 
Salam  A  1980  Rev.  Mod.  Phys.  52  525 

Sciama  D  W  1962  in  Recent  developments  in  general  theory  of  relativity  (Oxford:  Pergamon  Press)  p.  415 

Stueckelberg  E  C  G  1938  Hilv.  Phys.  Acta.  11  289 

Symanzik  K  1970  Commun.  Math.  Phys.  18  227 

Tam  A  C  and  Happer  W  1977  Phys.  Rev.  Lett.  38  278 

Tam  A  C  and  Happer  W  1977  in  Springer  series  in  optical  sciences,  Laser  spectroscopy  III.  (eds)  J  L  Hall  and 
C  L  Carlsten  (Berlin:  Springer  Verlag)  p.  335 
Utiyama  R  1956  Phys.  Rev.  101  1597 
Weinberg  S  1980  Rev.  Mod.  Phys.  52  510 

Werner  S  A,  Standenmann  J  L  and  Colella  R  1979  Phys.  Rev.  Lett.  42  1103 


Pramana,  Vol.  24,  Nos  1  &  2,  January  &  February  1985,  pp.  95-108.  ©  Printed  in  India. 


Bethe-Salpeter  dynamics  for  two-photon  processes* 
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Abstract.  Recent  experimental  data  on  single  hadron  production  by  two-photon  beams  in 
petra  and  pep  have  provided  a  unique  opportunity  for  testing  specific  models  of  confinement 
through  a  study  of  one  of  their  cleanest  predictions  viz  the  yy  -*  H  amplitudes.  Motivated  by 
this  new  facility,  a  QCD-oriented  Bethe-Salpeter  model  of  harmonic  confinement,  which  has 
already  been  found  to  describe  rather  well  several  classes  of  hadronic  data  (from  mass  spectra 
to  electromagnetic  and  pionic  couplings),  is  now  employed  for  a  detailed  comparison  of  its 
predictions  on  P  -*■  yy  and  T->yy  couplings  with  the  data.  The  agreement  is  quite  good  for  all 
cases  except  one  (7  yy). 

Keywords.  Bethe-Salpeter  dynamics;  two  photon  processes 

PACS  No.  11 10;  12  35;  13  40. 


1.  Introduction  and  summary 

Two-photon  processes,  especially  those  involving  Hyy  ( H  =  qq  hadron)  vertices,  offer  a 
unique  probe  into  the  mechanism  of  coupling  of  hadrons  (as  quark  composites)  to 
photons,  and  hence  of  the  underlying  dynamics  for  the  quark  structure  of  single 
hadrons,  unmixed  with  unwanted  hadronic  interactions.  The  simplest  processes  of  this 
kind  are  of  course  H  -*■  yy  decays,  but  except  for  n°  -*■  yy,  these  have  not,  until  recently, 
been  amenable  to  direct  experimentation  with  any  degree  of  reliability,  in  view  of  the 
huge  hadronic  background  generally  involved  in  the  final  state  when  vertices  of  order  e2 
are  sought  to  be  extracted  from  standard  hadron-hadron  collision  data.  Theoretical 
tools  for  a  deeper  understanding  of  H  ->  yy  events  have  for  a  long  time  been  limited  to 
generalities  like  sum  rules  for  P  —>  yy ,  based  on  the  Adler- Bell-Jackiw  anomaly  (Adler 
1969;  Bell  and  Jackiw  1969),  or  at  most  vmd  oriented  models  (Schwinger  1967),  but  with 
little  prospects  for  more  meaningful  comparison  with  concrete  dynamical  models 
based  on  quark  compositeness. 

The  spectacular  developments  of  e+  e~  Collider  techniques  at  petra  and  pep  in  this 
very  decade  have  let  to  powerful  photon  beams  with  high  two-photon  luminosities 
which  compete  with  e+  e~  luminosities  at  high  enough  energies.  Further  the  yy  cross- 
sections  for  hadron  production  which  increase  like  (In  E )2,  soon  dominate  over  cross- 
sections  (which  decrease  like  E  ~  2),  despite  the  disadvantage  of  the  e.m.  couplings  (order 
a4)  compared  to  their  e+e~  counterparts  (order  a2)  (Kolankski  1984).  On  the  other 
hand,  most  of  the  y-y  events  have  low  invariant  masses  of  the  produced  hadrons  due  to 


*  This  paper  is  offered  as  a  “Festschrift”  in  honour  of  Dr  Raja  Ramanna  on  the  occasion  of  his  sixtieth 
birthday.  The  subject  is  theoretical  in  content  but  seeks  to  exploit  an  entirely  new  window  opened  by  the 
latest  experimental  technology  (on  two-photon  physics).  As  such  it  is  appropriately  dedicated  to  one  of  the 
main  architects  of  nuclear  science  in  India. 
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the  characteristic  bremstrahlung  spectrum  ( ~  E~ 1 )  of  the  radiated  photons  (Kolankski 
1984).  Thus  the  y-y  collision  mechanism  facilitates,  among  other  things,  a  clean 
measurement  of  yyH  vertex  functions  for  relatively  low  mass  hadrons  ( H ),  say  up  to  a 
few  GeV.  This  is  mainly  because  of  the  stringent  selection  rules  governing  the  yy 
couplings  of  mesons,  which  greatly  restrict  the  number  and  variety  of  final  states 
involved  in  y-y  inelastic  processes,  thus  greatly  simplifying  the  theoretical  analysis  of  the 
process.  In  particular,  yy  couplings  of  pseudoscalar  ( P ),  tensor  ( T )  and  scalar  ( S )  mesons 
can  be  ideally  probed  by  two  photon  production  of  these  singlet  and  triplet  qq  states 
respectively.  This  way  of  studying  yyH  coupling  through  the  yy  H  process  merely 
amounts  to  interchanging  the  roles  of  the  initial  and  final  states  compared  to  the  old- 
fashioned  way  of  looking  at  the  same  coupling  through  a  decay  process  ( H  ->  yy)  but 
now  with  the  advantage  of  a  negligible  hadronic  background. 

By  this  new  method,  the  results  of ‘measurement’  of  several  H  ->  yy  couplings  ( H  —  T, 
P,  S)  have  already  become  available  (Kolankski  1984)  thus  making  it  worthwhile  to 
compare  these  relatively  clean  data  with  the  predictions  of  specific  (QCD-oriented) 
models  of  quark  dynamics.  Further,  such  processes,  involving  as  they  do  invariant 
masses  of  modest  magnitudes  (^few  GeV),  bear  not  so  much  on  the  perturbative 
aspects  of  qcd  (corresponding  to  the  asymptotic  freedom  region),  as  on  its  (theoreti¬ 
cally  less  tractable)  confinement  aspects.  This  twin  feature  of  an  emphasis  on  the 
confinement  region  on  the  one  hand  and  a  relative  theoretical  simplicity  on  the  other 
makes  a  yy  -*■  H  process  an  ideal  laboratory  for  testing  specific  models  of  confinement. 

In  this  paper  we  have  been  motivated  by  precisely  such  considerations  to  study  yyH 
couplings  within  the  framework  of  a  QCD-oriented  Bethe-Salpeter  (as)  model  (Mitra 
and  Santhanam  1981a)  hadronic  confinement  which  has  been  developed  and  tested  in 
recent  years  through  diverse  applications  from  hadron  mass  spectra  (Mitra  and 
Santhanam  1981b;  Kulshreshtha  et  al  1982;  Mitra  and  Mittal  1984)  to  their  e.m.  and 
pionic  form  factors  (Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and  Mitra  1983; 
Mitra  and  Mittal  1984).  The  only  inputs  are  the  (constituent)  quark  masses  (rnq)  and  a 
universal  reduced  spring  constant  (d>),  which  seem  to  describe  a  fairly  large  class  of 
hadronic  data  (Mitra  and  Santhanam  1981a,b;  Kulshreshtha  et  al  1982;  Mitra  and 
Mittal  1984;  Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and  Mitra  1983;  Mitra  and 
Mittal  1984).  The  initial  calculational  techniques  for  the  evaluation  of  matrix  elements 
of  e.m.  and  hadronic  currents  (Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and  Mitra 
1982)  by  the  method  of  null-plane  variables  (Bjorken  et  al  1971;  Feldman  et  al  1973; 
Fishbane  and  Namyslowski  1980)  have  since  been  refined  and  found  not  only  to 
simplify  the  calculations  but  also  to  lend  a  degree  of  elegance  and  transparency  to  the 
algebraic  structures  of  the  matrix  elements. 

The  hadronic  matrix  elements  can  be  classified  into  H  -»yy,H  -» yH'  and  H  -*  H'H" 
types  in  ascending  order  of  successively  larger  number  of  hadrons.  The  simplest 
process,  viz  H  — ►  yy,  involves  only  one  hadron  for  which  it  is  adequate  to  consider  it  to 
be  in  its  rest  frame.  The  other  two  classes  of  processes  involve  more  hadrons  than  one, 
and  for  these  it  is  necessary  to  take  proper  account  of  their  motion  in  their  respective 
wave  functions.  While  the  latter  cases  will  be  dealt  with  in  a  subsequent  communication 
(Mitra  and  Mittal  1985),  we  concentrate  in  this  paper  on  the  specific  process  H  ->yy 
which  is  not  only  free’  from  the  complications  due  to  hadronic  motion  but  its 
predictions  are  also  directly  amenable  to  the  new  data  on  yy  -*  H  matrix  elements 
(adequately  summarized  by  Kolankski  (1984)),  thus  hopefully  providing  a  fairly  clean 
(albeit  limited)  test  of  the  dynamical  content  of  a  single  qq  wave  function. 
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In  §2  we  summarize  the  essential  features  of  the  bs  formalism  for  the  wave  functions 
of  L-excited  hadrons,  and  evaluation  of  H  -*>  yy  matrix  elements  in  terms  of  null-plane 
variables.  In  view  of  the  relatively  unfamiliar  structure  of  the  bs  matrix  elements  in  the 
null-plane  variables,  some  attention  is  paid  to  the  method  of  integration  over-the 
corresponding  time-like  internal  momentum  ( q  _ )  which  is  shown  to  exhibit  the  P  yy 
matrix  elements  in  a  simple  and  transparent  form.  Section  3  describes  the  correspon¬ 
ding  algebraic  structure  of  the  T->yy  matrix  elements  on  closely  parallel  lines,  except 
for  the  heavier  algebra  involved.  Section  4  is  devoted  to  a  comparison  of  these 
predictions  with  the  recent  yy  ->  H  data  (Kolankski  1984;  pdg  1984;  Bartel  et  al  1982). 
The  theoretical  implications  of  the  agreement  (which  is  rather  good  on  the  whole)  are 
discussed. 

2.  bs  formalism  in  null  plane  variables:  The  P-+yy  process 

The  necessary  formalism  for  the  calculations  of  various  matrix  elements  in  terms  of 
Feynman  rules  was  outlined  earlier  (Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and 
Mitra  1983).  In  particular  it  was  emphasised  that  the  key  role  is  played  by  the  four¬ 
dimensional  bs  wave  function  which  must  first  be  reconstructed  from  its  corresponding 
instantaneous  (three  dimensional)  form.  Now  the  three-dimensional  wave  functions 
have  so  far  been  worked  out  (Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and  Mitra 
1983;  Mitra  and  Mittal  1984)  in  the  hadronic  c.m.  frame  (P  =  0)  which  should  be 
adequate  for  H  -*yy  coupling,  though  not  for  HH'y  and  HH'H "  couplings  involving 
more  than  one  hadron.  For  the  latter  cases  which  were  treated  under  the  rest  frame 
assumption  for  each  hadron  (Kulshreshtha  and  Mitra  1983;  Mitra  and  Mittal  1984) 
defect  was  sought  to  be  remedied  through  the  compensating  ansatz  of  a  Lorentz- 
invariant  adaptation  of  the  corresponding  scalar  form  factors  (Santhanam  et  al  1979), 
viz 

p(p2,  F2,  p"2)  -  .r  (p2,  p;2,  p;2). 

Such  an  adaptation  is  nevertheless  ad  hoc,  and  cannot  be  a  substitute  for  a  dynamical 
basis  for  wave  functions  of  moving  hadrons.  The  null-plane  formulation  of  the  hadronic 
wave  functions  seems  to  hold  considerable  promise  in  bridging  the  theoretical  gap 
between  rest-frame  and  moving  frame  wave  functions,  as  will  indeed  be  shown  in  a 
subsequent  communication  for  HH'y  and  HH'H"  processes  (Mitra  and  Mittal  1985).  In 
the  meantime,  for  the  process  H  -*  yy  on  hand,  it  should  be  enough  to  summarize  the 
essential  details  of  the  structure  of  the  three-dimensional  qq  wave  function  ip  in  the  null- 
plane  approximation  (npa)  vis-a-vis  its  four-dimensional  bs  form  'F.  For  any  four- 
vector  A  ,  the  null-plane  components  are  defined  as  (Bjorken  et  al  1971;  Feldman  et  al 
1973;  Fishbane  and  Namyslowski  1980). 

A  1=Al,A2;  A±  =  A0±A3.  (1) 

This  decomposition  holds  for  each  of  the  4-momenta  of  the  qq  hadron  (P„)  as  well  as 
those  of  the  individual  quark  (plfl)  and  antiquark  (-p2fl)  as  shown  in  figure  1(a). 
The  roles  of  the  time-like  momenta  in  npa  are  played  by  1/2  _  ( i  =  1, 2),  instead  of  by 

pi0  in  the  instantaneous  approximation  (ia),  while  the  Z  components  in  npa  are 
effectively  pi+  instead  of  pi3  of  ia.  The  relative  ( q )  and  total  (P)  4-momenta  are  defined 
for  equal  mass  quarks  through 

Pi, 2  =  2P±q 


(2) 
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and  the  null-plane  assumption  (npa)  consists  in  integrating  out  the  bs  four-dimensional 
qq  wave  function  ¥  w.r.t.  the  variable  1/2  q_  to  obtain  an  effective  three-dimensional 
entity  \p  as  (Kulshreshtha  and  Mitra  1983). 


<A(q)  =  <A(q±,4+)  = 


id 4- ¥(/>!,  p2). 


j 


as  a  substitute  for  the  instantaneous  approximation  (ia)  which  reads  as  (Mitra  and 
Santhanam  1981a) 


<A(q)  =  <A(q±,<73)  = 


/* 


dqo'v&uPi)- 


The  inverse  npa  relation  between  \p  and  for  a  moving  hadron,  taking  account  of  its 
spin-structure,  is  expressed  by  (Mitra  and  Kulshreshtha  1982;  Kulshreshtha  and  Mitra 


1982;  Mitra  and  Mittal  1985) 

^(Pi,P2)  =  ^(-p2)r(q)Sf(Pl),  (5) 

r(q)  =  N//  r</>L  (q)D  (q)/2ni,  (6) 

D  (q)  =  2 P+  (m2q  A-  q\  - \M2)  +  2 P_  q\ .  (7) 

T  =  y5  or  iy  -e  according  as  the  qq  hadron  is  a  singlet  or  triplet  structure  respectively. 
</>  (q)  is  the  scalar  part  of  the  wave  function  which  for  an  arbitrary  L  excited  state  has  the 
form  (Mitra  and  Mittal;  Mitra  and  Sood  1977) 

<I>l  =  nLp~Lqit - qh  B[t  ,L  <j>0,  (8) 

n\  —  (M/P+)>(nfS2)~312  2L/L\  (9) 

(.0) 

P2  =  d)(Mmq)*[  1  +2cD2/Mm<?]_2.  (11) 


is  a  symmetric  traceless  rank -L  tensor  normalised  as  (Mitra  and  Sood  1977) 


Z  Btr. .  >L  (P)  ^f(M) .  (P)  =  oil : : : 


M 


(12) 


where  the  projection  operator  6 jjJ*  for  a  hadron  of  spin  J  and  4-momentum  P  has  the 
standard  definition  (Mitra  and  Sood  1977).  The  tensor  BL  fully  defines  the  relativistic 
structure  of  a  singlet  qq  state  of  J  =  L.  For  a  triplet  qq  state,  on  the  other  hand,  the  full 
spin-structure  is  expressed  by  the  CG  decomposition  in  three-dimensional  form  (Mitra 
and  Sood  1977) 


£i  ®Bfi)  =  T^+1  iL  +  i 


AL 

JAji 2  ■  ■  ■  lL 


(13) 


(  l(l+ i)  yc 

\2{2L  +  1)/  L+1 


6::  Sf- 

“i  h  ■ 


1 

. .  iLi 


of  which  only  the  first  (tensor)  state  with  unit  coefficient  corresponds  to  the  highest  J- 
value  (=  L-\- 1)  while  the  satellites  ( AL ,  SL~l )  have  more  involved  coefficients  which 
must  be  taken  into  account  for  determining  the  relative  strengths  of  various  couplings 
of  the  satellite  states  ( J  <  L+  1)  vis-a-vis  those  of  the  main  state  (J  =  La-  1).  For 
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formal  covariance,  each  index  i  in  (12)  may  be  interpreted  as  a  4- vector  as  defined  in  the 
rest  frame  (P  =  0)  of  the  hadron  through  formal  extensions  such  as 

-  %  ( py,  q,(P)  =  q,  +  qPPfXM~2.  (14) 

Substitution  of  (13)  in  (8)  gives  the  following  bs  structures  for  qq  singlet  and  triplet 
states  respectively 


r<t>L  =  nLp  Lys «P)  .  .  .  q^ ( P )  ^ 

r 4>l  =  nLp-L%qUl  •  •  •  qnT™  . .  „L  +  {ST), 

where  for  L  =  1,  the  satellite  terms  (st)  are  given  by 


(15) 

(16) 


(st)  =  nip  1  ^qlljllAl+~iy-qS0^j.  (17) 

Here  A1  and  S°  are  (L  =  1)  axial  vector  and  scalar  mesons  respectively  of  C  =  +  1, 
while  B1  is  an  axial  vector  of  C  =  —  1,  and  B°  (=  P)  is  simply  a  pseudoscalar  meson. 

The  normaliser  NH,  equation  (6),  associated  with  the  bs  vertex  T (q)  is  defined  as 
(Mitra  and  Kulshreshtha  1982) 


2  iP,  =  {2nfTv^q\fP^iy^(mq- 


<r  Pz )  -  ¥  (m,  + 


(18) 


This  equation,  when  simplified  as  in  Mitra  and  Kulshreshtha  (1982),  yields  the 
following  results  for  NH. 


(2)VH)-^M(2^{m^)  +  ^(iJj++y},  ,19) 

where  the  two  entries  refer  to  £L-type  ( J  =  L )  and  TL+1-type  (J  =  L+  1)  states 
respectively. 

In  this  paper  we  are  interested  only  in  the  yy-couplings,  figure  1  (b,  c),  for  relevant 
meson  states  (T,  B,  S )  at  rest  (P  =  0),  but  with  integrations  carried  out  in  the  null-plane 
language.  The  cases  of  immediate  physical  interest  are  several  tensor  A2,  /2)  and 

pseudoscalar  (n°9  q,  q\  qc)  mesons,  while  the  corresponding  data  on  scalar  mesons  ( S *, 


Figure  1.  a  bs  diagram  for  a  qq  hadron;  b  and  c  are  diagrams  for  H  -*  yy  process. 
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Xq)  are  not  yet  available.  To  calculate  the  matrix  elements  it  is  good  to  separate  out  the 
colour-cum-charge  factors  ( fc )  which  are  collectively  given  for  the  different  cases 

(colour  contributes  y/3  in  all  cases),  all  in  units  of  c2/x/6,  by  the  following  table 

n°(A2)  f°  f'  qc  X2 , 
fc=  1  5/3  ^2/3  4v/2/3  4^2/3.  (20) 

For  the  mixed  states  r\,  tj\  again  in  units  of  e2/v/ 6,  the  corresponding  numbers  must 
take  account  of  the  mass  difference  between  ud -  and  s-quarks  in  the  calculation  of  the 
matrix  elements  (Mitra  and  Mittal  1984).  This  requirement  is  however  met  by  writing 

the  effective  yy  coupling  constants  (without  the  factor  e2/yj6)  in  term  of  a  2  x  2  fc  matrix 
as; 


fgri)  =  l5/ 3  COS  +  ^ 

\9n'J  \  5/3  sin  (0  +  a) 


where  the  reduced  yy  coupling  constants  gud  and  gs  are  calculated  from  figure  1  (b,  c) 
assuming  only  one  type  of  quark  (ud  or  s )  emitting  the  photons.  Here  (Mitra  and  Mittal 
1984) 

tana  =  y/2;  0  —  -  10-8°.  (22) 


Such  effective  coupling  constants  may  be  formally  defined  through  a  corresponding 
Lagrangian  which  may  be  written  for  a  P  -+  yy  process  as  (Kulshreshtha  and  Mitra 
1983) 


0eir  <w„  1 42)  P„QJ4n2M, 


(23) 


where  2 =  klfl  —k2fl.  The  quantity  may  be  identified  by  comparison  with  the 
matrix  element  for  P -*  yy  via  figure  1  (b,c),  viz 


f  e2  ^ 

M(P  -+  yy)  =  Tr 
^6 


d4q[_'f(Pi,P2)iyei')SF(q-Q)iye{2)  +  (1  ^2)],  (24) 


where  T  is  the  bs  amplitude  for  the  pseudoscalar  meson.  Substitution  of  (5)  to  (1 1)  and 
(15)  with  L  =  0  in  (24)  yields: 


1  1 
a3+  +aJ 

where 

Ai,2  =  m2q+pi_  2, 

A*  =  m2  +  (q+  Q)2, 


Geff 

4n2 


=  (nP2)~"fcNH 


4Mmq  D(p o 
A1A2  2ni 


(25) 

(26) 
(27) 


and  fc  is  the  charge-colour  factor  as  listed  in  (20)  and  (21). 

The  rest  of  the  section  is  concerned  with  our  procedure  for  integration  of  (25)  w.r.t. 
the  null-plane  variables  which  will  also  hold  in  identical  form  for  the  T  -*■  yy  matrix 
elements  whose  details  are  given  in  the  next  section.  In  view  of  (2)  the  q_  integration  in 
(25)  may  be  expressed  in  terms  of  either  px  _  or  p2  _  whose  pole  position  are  governed  by 
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the  three  denominators  A  x ,  A2 ,  A  3 .  For  definiteness  we  concentrate  on  the  p2  _  -  variable 
which  has  a  pole  in  A2,  below  the  real  axis,  viz 

Pi-  =  (mi  +  9  “  h-)/Pi  +  =  (a>l-ie)/p2+.  (28) 

Since  pl±  =  P±  —  p2±  the  corresponding  p2 —-pole  arising  from  Aj,  w'z 

Pl-  =  P-  —  (Ct>x  — *fi)/Pl  + 

lies  aficwe  the  real  axis,  and  would  not  contribute  if  the  semi-circle  is  closed  from  below. 
As  for  A1,  their  p2-~ poles  are  respectively: 

Pi-  =  ±P-TQ- -(ojl-ie)/(q++Q  +  ),  (29) 

r 

where 

kU2=iP±Q,  2Q±=kl±-k2±.  (30) 

Thus  the  conditions  for  these  poles  to  be  below  the  real  axis  are  (q+  +  Q  +  )  <  0 
respectively.  (Note  that  P±  =  M  and  Q+  =  —Q-  in  the  rest  frame  of  the  hadron). 

With  these  restrictions  on  the  limits  of  the  (subsequent)  q  +  -integration,  the  various 
residues  from  available  poles  can  be  evaluated  in  a  straightforward  manner  and  the 
result  collected.  Considerable  simplification  is  achieved  by  noting  the  following  result 

Res  ofuy at  A2-pole=z )’ 

and  using  the  following  definitions 

2^1+  —  +  /?i  +  ki  _  (q  —  (2)  +  > 

2+  =  k2  +  a>l-p2  +  k2_  (q-Q)+, 

2^1  -  =  k2  +  ®i  4"  Pi  +  ^2  -  (9  +  S)+  > 

^D2_  =  +  O}]^  —  p2  +  kl_  (q  +  Q)  + . 

The  result  for 


(31) 


(32) 


is 


D  1 
2ni  Ai  A2 


2p1+  +  2(q+-Q  +  )D 


D 


2  + 


01  +  + 


q+  —  Q  +  <0 


D  2- 

which  simplifies  to 


2p2  +  _j_2(^f+  +  2+  )0 


D\-D2- 


q++Q+  <0, 


2p 


1  + 


01 


|  2p2  -f 

<?+  <  Cf  ^2  + 


>  c+ 


(33) 


+ 


2pi 


+ 


+ 


2p 


2  + 


4  +  <  -C- 


D 


(34) 


q+  >  -Qi 


An  identical  result  would  have  been  obtained  if  the  integration  were  performed  over  the 
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--variable.  To  further  simplify  the  expression  for  ,/2,  take  Q+  >  0  (without  loss  of 
generality).  Since  Q  +  =  {M,  one  has /q  _  =  k2+  =0  ;k1+  =  k2~  —  M.  Then  the  second 
and  third  terms  in  (34)  will  make  negligible  contributions  on  q  +  -integration,  while  the 
first  and  the  fourth  simplify  just  to: 


P  i  +  P  2  + 
Mcol  Mcol 


(35) 


Substitution  of  (35)  in  (25)  yields 

£eff  =  {nP2)-h6n2MmqfcNH 
which  simplifies  to 


d  Lq 


co2± 


dq+exp(-$q2p  2)  (36) 


j-ie. 


=  4m?  (rf+ifi2)-*  \~p2-)  fJo(yo)Erf(x  o), 


(37) 


where 


l(y0)  =  n 

m. 

*00 

x"  dx  exp  ( —  x)/ (x  -1-  y0 ), 

0 

(38) 

y0  =  WqP  2.  xo  =  \Q+\l\fiP- 

The  P  -*■  yy  decay  width  is  finally  given  by 

(39) 

r  (P-*  yy)  = 

l3efrl2a2/c2M(384rc3) 

(40) 

3.  Structure  of  the  T-+yy  amplitude 


To  evaluate  the  yy  couplings  of  tensor  mesons,  the  tensor  structure,  (16),  must  be 
substituted  in  (6)  for  T  (q).  The  result  is 


&  (T-*  yy) 


AjA2  2ni 


p 


+ 


At 


(41) 


where  A1>2  and  Aj  are  given  by  (26),  (27)  and 


T±  =  jTr  (mq  +  iyp2)iy£(mt—iyp1)r±,  (42) 

T+  =  iye,1)[m4  —  iy(4— G)]iy-fi(2> 

T _  =  iy  ■  e,2>  [m,  —  iy  ■  (q  +  (?)]  iy  -£a\  (43) 

4  =  ii(P)7w,n  (44) 

The  trace  evaluation  is  considerably  simplified  by  observing  that  each  of  the  quantities 
T+  can  be  effectively  replaced  by  1  /2  ( T+  +  T_ ),  since  the  difference  (T+  —  T_ )  vanishes 
on  integration  w.r.t.  q.  The  symmetric  part  is  further  simplified  in  three-dimensional 
tensor  notation  (in  the  rest  frame  of  the  hadron)  as 


2  (TV  +T_)  =  -{iM2  +  CD21  +  q2+)2AijqiqlTjl 

+  2qiTlm qm  (2Aijqi q}  -  m2  e(l 1  •  e(2>), 


(45) 
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where 

2A(j  =  eS1)eJ2)  +  fij2)e51)  —  0ije(1)-£(2) 

du  =  di<-Q,Qj> 


(46) 

(47) 


0^  is  a  two-dimensional  projection  operator  appropriate  to  the  transverse  character  of 
each  photon,  and  satisfies  the  relations 


■A-ij  ®im 


(48) 


Equation  (45)  can  be  recast  in  terms  of  the  two  basic  invariants 

R^2AijTij;  S  =  QiTijQj  (49) 

representing  the  respective  couplings  of  helicity  two  and  zero  states  of  the  yy  system  to 
the  (tensor)  hadron.  This  reduction  is  faciliated  by  an  angular  integration  over  the 
q1-vector,  noting  that  the  denominator  after  -integration  in  (41)  will  involve  only 
co2^  vide  (35).  The  main  formulae  are 

<  >  =  Oimiql  +  qlQiQm,  (5°) 

2Aijf,m  <  >  =  (i q2i)2R-  (51) 

After  the  -integration  in  (41)  for  which  the  result  (35)  for  (33)  can  be  directly  taken 
over,  the  integrals  over  q±  and  q  +  are  again  similar  to  those  encountered  in  (36).  The 
final  result  for  (41),  taking  account  of  (42)-(51),  is 


2  \  A 


K+!£2)-2 


xl-Rp2  J,  (P2  +  m2q  +  i M2)  +  2 ml P2S(JX-  J0)],  (52) 


where  Jn(n  =  0, 1)  are  as  defined  in  (38)  with  the  same  argument  as  y0  given  by  (39). 
(Since  tensor  mesons  are  all  heavy,  the  q+ -integration  has  been  taken  as  effectively 
extending  to  oo,  thus  making  Erf  (x0)  %  1:).  Expressing  (52)  in  the  form: 


3E(T^yy)  =  AR  +  BS 
the  T-^yy  width  is  given  by 


(53) 


r«=  £  \^(T->  yy)\2  |Q|/807iM2 

all  poll 


|g| 

1671 M2 


(54) 

(55) 


4.  Discussion  and  conclusions 

With  the  help  of  the  formulae  (40)  and  (55)  we  are  in  a  position  to  predict  the  yy  widths 
of  P-  and  T-  mesons  respectively.  The  basic  inputs  for  this  purpose  continue  to  be  the 
same  (Mitra  and  Mittal  1984)  as  those  employed  for  several  other  successful  predictions 
of  the  bs  model,  viz 

mud  =  0*30,  ms  = 


0-40,  mc  —  1-66,  cb  —  0T4 


(56) 
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all  in  GeV  units,  while  the  masses  of  the  concerned  hadrons  may  be  taken  either  from  the 
predictions  of  the  bs  model  (Mitra  and  Mittal  1984)  or  from  the  pdg  (1984)  tables, 
which  are  close  enough  to  each  other  so  as  not  to  cause  discrepancies  in  their  yy-width 
predictions.  For  definiteness,  the  mass  values  have  been  taken  from  the  pdg  tables  and 
the  results  for  the  yy  widths  are  listed  in  table  1,  together  with  their  recent  data  (pdg 
1984;  Althoff  et  al  1983;  Edwards  et  al  1982;  Weinstein  et  al  1983;  Roussarie  et  al  1981; 
Jenni  et  al  1983;  Behrend  et  al  1982,  1983;  Bartel  et  al  1982). 

F-yy  couplings 

We  start  by  discussing  the  yy  couplings  of  tensor  mesons  which  are  believed  to  be  free 
from  theoretical  problems  (such  as  anomalies).  As  seen  from  table  1,  the  agreement 
with  available  data  is  remarkably  close,  except  for  the  Xi  yy- 

For  the  light  quark  mesons,  the  main  question  centres  on  what  theoretical 
significance  would  be  attached  to  this  impressive  agreement.  Since  the  calculations  have 
been  made  for  the  decay  for  a  hadron  into  two  (real)  photons,  and  the  corresponding 
“data”  are  extracted  from  the  (yy)  cross-section  for  the  production  of  the  hadron  by  two 
virtual  photons,  the  theoretical  and  experimental  conditions  are  not  quite  the  same, 
thus  tending  to  cast  doubts  on  the  genuineness  of  the  agreement.  While  referring  the 
interested  reader  to  Kolankski  (1984)  for  details  of  the  analysis  of  yy  -►  H  data,  we 
should  nevertheless  like  to  point  out  some  salient  features.  Thus  the  nearer  the 
experimental  conditions  are  for  almost  real  photons  (/cf,  k\  «  0)  the  better  is  their 
overlap  with  our  theoretical  premises.  For  small  k2,  it  is  quite  reasonable  to  assume  the 
predominance  of  the  helicity-two  (2  =  2)  amplitude  over  the  lower  helicity  (2  =  0,  1) 

Table  1.  H  -+yy  widths  (in  keV)  for  different  mesons  ( H  =  P,  T).  The  experimental  data 

listed  are  from  the  sources  indicated  in  the  last  column. 


Process 

r  (theor) 

yy 

T  (expt.) 

yy 

Expt.  references 

t 

o 

2-20 

2-95  ±0-30  +  0-40  <  AV  > 

Althoff  et  al  (1983) 

Brandelik  et  al  (1981) 
Edwards  et  al  (1982) 
Weinstein  et  al  (1983) 
Dainton  (1983) 

Brighton  (1983) 

A2-+yy 

0-74 

0-79  ±0-23  ±0-25  <  AV  > 

Edwards  et  al  (1982) 
Weinstein  et  al  (1983) 
Behrend  et  al  (1982) 

/'-►yy 

0-134 

0-11  ±0-02  ±0-04 

Althoff  et  al  (1983) 

Brandelik  et  al  (1981) 

h  — ►  yy 

0-178 

1-8  ±1-3  (prelim) 

Quoted  in  Kolankski  (1984) 

n°  ->  yy 

6  61  x  10"3 

(7-85  ±0-54)  x  10" 3 

pdg  (1984) 

n-*yy 

0-060 

0-324 ±0-05  AV;  0-56±0-1215 

pdg  (1984) 

Edwards  et  al  (1982) 
Weinstein  et  al  (1983) 

n’  -*yy 

4-59 

5-5 ±0-7  AV;  3-8 ±0-6 

pdg  (1984) 

Althoff  et  al  (1983) 

Brandelik  et  al  (1981) 

nc->yy 

2-18 

6-2  ±0-5 

Bloom  et  al  (1983) 

n i->yy 

011 

0-4  ±? 

Bloom  et  al  (1983) 
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modes,  an  assumption  which  has  generally  been  made  for  the  extraction  of  T  -*  yy 
couplings  from  the  “raw”  data.  This  assumption  is  fully  borne  out  in  the  bs  model  where 
each  photon  is  real  (kf  =  0),  since  an  overwhelming  contribution  (90-95  %)  comes  from 
the  R- term  (A  =  2)  and  only  a  small  amount  (  <  10%)  from  the  5-term  (2  =  0). 
However,  to  the  extent  that  the  experimental  conditions  differ  from  k2  =  0,  un¬ 
certainties  in  the  estimates  cannot  be  ruled  out.  For  example,  the  tasso  collaboration 
(Althoff  et  al  1983;  Brandelik  et  al  1981)  studied  the  k 2 -dependence  of  the  f°  yy  coupling 
by  keeping  one  of  the  photon  momenta  fixed  at  k2  —  0-35  (the  “tagged”  photon)  and 
found  a  suppression  of  as  much  as  50  %  relative  to  the  “no-tag”  result.  Our  theoretical 
premises  for  f°  — ►  yy  seem  to  be  more  in  harmony  with  the  “no-tag”  experimental 
conditions,  for  which  the  T  -*•  yy  results  are  listed  in  table  1. 

Another  source  of  uncertainty  concerns  the  resonant  vs  non-resonant  contribution 
to  the  specific  yy  cross-section  from  which  the  H  -+yy  couplings  are  extracted.  For 
example,  the  nn  channel  which  is  dominant  for  f°yy  coupling  has  a  large  non-resonant 
background  in  the  n  f  n~  case  (tasso)  (Althoff  et  al  1983;  Brandelik  et  al  1981),  and  a 
smaller  one  in  the  n0n0  state  (Edwards  et  al  1982;  Brandelik  et  al  1981),  and  these 
features  require  a  modest  model  dependence  of  the  corresponding  data  analysis. 
Similarly  the  A2  -*  yy  couplings  have  been  measured  through  both  the  decay  modes  qn0 
(Edwards  et  al  1982;  Weinstein  et  al  1983),  and  pn  (Behrend  ef  <2/ 1982, 1983),  again  with 
the  helicity-two  assumption,  but  with  suitable  parametrizations  of  the  (modest) 
background  effects.  Similar  remarks  apply  to  /'  — >  yy. 

P  -+  yy  couplings 

The  traditional  example  of  the  P  ->yy  coupling  is  yy  decay  of  a  n°  which  has  not  only 
played  a  central  role  in  establishing  the  colour  degree  of  freedom  of  quarks,  but  also 
acted  as  a  laboratory  for  testing  the  twin  concepts  of  pcac  and  the  axial  vector  anomaly 
associated  with  the  so-called  triangle  diagram  (Adler  1969;  Bell  1969).  In  this  respect  we 
recall  the  original  reasoning  of  Adler  (1969)  that  the  triangle  (anomaly)  diagram 
provides  the  “driving”  matrix  element  for  determining  the  n°  — ►  yy  amplitude  by 
interpreting  it  as  the  vacuum  -*  yy  transition  of  the  pcac  term  m2(pn  in  the  anomaly- 
modified  relation  (Alder  1969;  Bell  1969)  for  dj*  and  demanding  that  the  latter  should 
vanish  in  the  limit  (k1  +/c2)2  =  0.  Thus  the  Alder  reasoning  provides  a  much-needed 
handle  on  the  calculation  of  n°yy  coupling  at  the  physical  n°  mass  (kx  -I-  k2)2  —  —  m2,  in 
terms  of  the  Ward  identity  requirement  that  the  n°  ->  yy  amplitude  should  vanish  at  (/cx 
+  k2)2  =  0,  but  such  a  handle  presupposes  that  n°  is  an  elementary  field  (Goldstone 
boson).  On  the  other  hand,  in  the  standard  quark-picture,  the  pion  is  just  as  much  of  a 
qq  composite  as  any  other  meson,  and  figures  1  (b,  c)  tell  precisely  how  the  n°  as  a 
composite  particle  couples  to  yy,  without  any  further  need  to  invoke  the  anomaly- 
modified  pcac  relation  to  define  its  yy  coupling  at  the  physical  pion  mass  (Adler  1969; 
Bell  1969).  In  other  words  figure  1  (b,  c)  not  only  play  the  roles  of  the  anomaly 
diagrams  themselves  for  the  composite  pion,  but  they  also  provide  a  complete 
description  for  the  n°  -*  yy  amplitude  without  any  further  assumptions  (Adler  1969; 
Bell  1969).  Exactly  similar  considerations  hold  for  the  yy  couplings  of  the  other 
pseudoscalar  mesons.* 


*  This  point  of  view  is  somewhat  at  variance  with  the  position  adopted  in  our  earlier  paper  (Kulshreshtha 
and  Mitra  1983)  which  we  no  longer  consider  tenable.  The  integration  technique  presented  here  for  P  -+  yy  is 
also  numerically  more  accurate  than  the  one  employed  earlier. 
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It  is  against  this  background  that  a  comparison  of  the  bs  predictions  with  the  data  on 
P  ->  yy  couplings  should  be  viewed.  First  of  all  we  notice  from  (37)  to  (40)  that  the 
Af-dependence  of  T  is  Af13/4,  roughly  in  accord  with  the  phenomenological  analysis 
(Kolankski  1984)  based  on  T  ~  M3  (the  balance  of  Af-dependence  being  a  dynamical 
effect).  The  predicted  n°  ->  >7  and  g'  — ►  yy  rates,  the  latter  on  the  basis  of  the  mixing 
angle  (22),  are  seen  to  be  in  good  accord  with  the  data,  but  the  g  case  seems,  prima  facie, 
to  be  a  disaster.  This  is  particularly  so  when  it  is  remembered  that  the  rate  (0-56) 
estimated  by  the  Crystal  Ball  group  (Edwards  et  al  1982;  Weinstein  et  al  1983)  based  on 
the  analysis  of  the  reaction 

e+  e~  — ►  e+  e~  t]  -*  e+  e  "  yy  (57) 

is  even  higher  than  the  more  conventional  value  (0-324)  quoted  by  pdg  (1984)  based  on 
the  Primakoff  effect.  This  is  a  non-trivial  discrepancy  but  a  major  part  of  it  must  be 
attributed  to  the  g-g'  mixing  effect.  Indeed  as  seen  from  the  mixing  matrix,  (21),  the 
interference  is  constructive  (and  hence  relatively  insensitive)  for  r\  — ►  yy.  On  the  other 
hand,  it  is  destructive  for  r\  -►  yy,  and  hence  highly  sensitive  to  the  0-value  which  is  far 
from  ‘ideal’  in  this  pseudoscalar  case.  (For  further  discussion,  see  Kolankski  1984) 
As  for  the  heavy  quarkonium  states  rjc,  rjb,  unfortunately  the  results  of  direct 
measurements  of  the  type  (57)  are  not  yet  available.  Rather,  the  data  (Bloom  et  al  1983) 
listed  in  table  1  are  highly  indirect,  being  based  on  the  observation  (Kolankski  1984) 
that  the  wave  function  at  the  origin  should  be  the  same  for  both  (rjc)  and  iSl  (t/// J) 
states,  so  that  the  accurately  measured  widths  of  the  latter  can  be  used  to 

“determine”  the  corresponding  P~+yy  widths  through  an  essentially  geometrical 
relationship.  On  the  other  hand  it  is  possible  to  question  the  validity  of  this  assumption, 
and  indeed  an  independent  estimate  of  rjc  -+  yy  based  on  the  branching  ratio  for  if/  ->  yrjc 
yields  a  much  smaller  value,  viz  (Kolankski  1984) 

F  {gc  —*■  yy)  =  l-6±0-8keV,  (58) 

which  seems  to  be  consistent  with  our  theoretical  prediction.  (The  rate  for  rjb  -►  yy  has 
been  listed  only  for  illustration,  since  in  any  case  the  bs  model  for  harmonic 
confinement  cannot  be  expected  to  work  for  bb  spectroscopy). 


rjc  -►  gg  width 


We  have  attempted  another  independent  check  on  the  t]c  wave  function  through  a 
prediction  of  the  gc  ->  gg  width  as  an  ozi  forbidden  process.  The  r\c  -►  gg  coupling  can 
be  trivially  calculated  in  our  model  by  making  the  following  replacements: 


Color :  ^/3  =  —y=SH 
'  V3 


Charge : 


fc=>(4nocs), 


(59) 

(60) 


where  <as )  is  the  “strong”  coupling  constant.  Thus  the  overall  replacement  in  (40)  for 
estimating  T  (P  -►  gg)  becomes 


a 
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7(2/3)  <as>. 


(61) 
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For  a  realistic  estimate  of  <as>  appropriate  to  the  confinement  region,  the  following 
replacement  of  the  perturbative  Q  —  Q  interaction  ( =  cajr)  by  the  harmonic  interaction 


M') 

r 


-22 
-m,a>  r 

2  c 


should  be  in  order,  thus  leading  to  the  determination 


(62) 


<as>  =  t rnc of  <r3  > 


(63) 


where  <r3 )  must  be  calculated  for  the  QQ  state  under  study. 

This  gives 

<as  >  =  3 mc(b2/p3  Jn.  (64) 

Substitution  of  (61)  and  (64)  in  (40)  yields  the  not  unreasonable  estimate 

r  (r}c  -*  gg)  =  4-66  MeV,  (65) 

which  compares  favourably  with  the  values  (1 1-5  ±  5;  8-3  ±  -5)  quoted  by  Bloom  et  al 
(1983).  On  the  other  hand,  the  same  formula  (62)  yields  for  a  typical  ozi  allowed  process 
the  value 

T  (f°  -+  gg)  =  45-95  MeV.  (66) 

which  is  only  about  25%  of  the  total  /0  width  (pdg  1984). 

To  summarise,  the  overall  consistency  of  the  several  T-+yy  data  with  the  predictions 
should  warrant  a  cautious  optimism  about  the  dynamical  status  of  the  qq  structure  of 
these  hadrons  in  the  bs  model  for  harmonic  confinement.  Notwithstanding  this 
conclusion,  it  need  hardly  be  emphasized  that  such  a  limited  comparison  of  the  data  at 
H  -*  yy  coupling  level,  requiring  as  it  does  a  model-dependent  data  analysis,  cannot  be  a 
substitute  for  a  more  direct  comparison  with  the  “raw”  data  available  in  the  form  of 
various  two-photon  inelastic  cross-sections  themselves.  Such  calculations  are  envisaged 
in  the  near  future. 
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Adiabatic  time-dependent  Hartree-Fock  theory  with  the 
consistency  condition 

A  K  MUKHERJEE  and  M  K  PAL 

Saha  Institute  of  Nuclear  Physics,  Calcutta  700009,  India 

Abstract.  Adiabatic  time-dependent  Hartree-Fock  (atdhf)  theory,  including  the  authors’ 
work  in  this  field,  has  been  summarised.  In  response  to  the  criticism  of  Yamamura  et  al  the  role 
of  curvature  in  preventing  the  choice  of  pure  rpa  mode  as  the  solution  near  the  static  Hartree- 
Fock  minimum  has  been  discussed. 

Keywords,  atdhf;  nuclear  fission;  Lipkin  model;  pure  rpa  mode. 

PACS  No.  21  60;  25-85 


1.  Introduction 

In  recent  years  there  has  been  an  increasing  interest  among  nuclear  theorists  in 
developing  a  microscopic  theory  for  the  description  of  nuclear  dynamics.  The  success  of 
the  mean-field  approximation  in  describing  the  static  properties  of  nuclei  as  well  as  the 
incredible  ease  of  the  Hartree-Fock  (hf)  calculation  (Vautherin  and  Brink  1973) 
attained  with  the  use  of  the  density-dependent  effective  two-body  interaction  suggests 
that  the  mean-field  approximation  could  also  be  used  to  describe  nuclear  dynamics.  The 
old  formulation  on  the  time-dependent  Hartree-Fock  (tdhf)  theory  due  to  Dirac 
(1930)  has  been  revived  (Bonche  et  al  1976)  designing  it  for  the  description  of  the  time- 
dependent  nuclear  processes,  especially  for  the  large  amplitude  collective  motions  of 
nuclei.  Sufficient  progress  has  been  made  (Negele  1982)  in  the  last  decade  in  applying 
the  tdhf  theory  for  the  description  of  heavy-ion  collision  and  encouraging  results  for 
the  deep-inelastic  scattering,  fusion  events  etc  have  been  obtained.  However,  the 
limitations  of  tdhf  in  describing  such  many-body  processes  like  nuclear  fission,  for 
which  a  description  in  terms  of  barrier  penetration  is  traditionally  employed,  have 
become  clear  (Reinhardt  1981),  the  tdhf  solution  gets  trapped  inside  the  barrier  and 
can  never  lead  to  fission  into  separated  fragments. 

An  independent  formalism,  namely,  the  adiabatic  time-dependent  Hartree-Fock 
(atdhf)  theory  has  been  formulated  (Baranger  and  Veneroni  1978;  Brink  et  al  1976; 
Villars  1977;  Moya  De  Guerra  and  Villars  1977;  Goeke  and  Reinhard  1978)  in  the  last 
few  years  especially  designed  for  those  processes  whose  dynamical  evolution  is 
governed  by  one  or  a  few  collective  variables  and  in  which  the  collective  motion  is  slow 
compared  to  the  single  particle  oscillations.  Typical  examples  of  such  process  are  the 
nuclear  fission  and  the  inverse  process,  the  sub-barrier  fusion  of  two  heavy  ions. 

Investigations  have  been  carried  out  by  several  groups  of  workers  (Reinhard  and 
Goeke  1978;  Goeke  et  al  1981;  Rowe  1982;  Rowe  and  Ryman  1982;  Moya  De  Guerra 
and  Villars  1978;  Mukherjee  and  Pal  1981, 1982a,  b)  on  whether  the  conditions  derived 
from  the  mean-field  theory  of  the  adiabatic  many-body  process  provide  a  unique 
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specification  of  the  collective  subspace.  It  has  turned  out  in  these  investigations 
(Mukherjee  and  Pal  1982a,  b;  Pal  1984)  that  the  zeroth-  and  first-order  atdhf 
equations,  called  the  Villars  equations,  supplemented  with  a  consistency  condition 
derived  from  the  second-order  atdhf  equation  specify  the  collective  path  uniquely. 
Further,  it  is  a  satisfying  result,  as  has  been  proved  analytically  by  Mukherjee  and  Pal 
(1982a)  that  the  path  followed  by  the  many-body  system  during  its  adiabatic  motion  is 
indeed  the  ‘fission  path’  which  follows  the  bottom  of  the  valley.  In  §2  a  short  review  of 
the  atdhf  theory  has  been  included. 

However  there  has  been  a  certain  criticism  due  to  Yamamura  et  al  (1984)  on  the 
results  obtained  by  Mukherjee  and  Pal  (1981)  regarding  the  non-uniqueness  of  the 
solution  of  Villars  equations.  In  the  context  of  the  three-level  Lipkin  model  it  has  been 
argued  by  Yamamura  et  al  (1984)  that  in  the  neighbourhood  of  a  critical  point  of  the 
collective  potential  the  canonicity  condition  along  with  the  rpa  boundary  condition 
restricts  the  solutions  of  the  Villars  equations  to  be  unique  and  is  given  by  the  lowest- 
frequency  principal  line.  In  §3  emphasizing  the  role  of  curvature  of  the  collective  path, 
the  ambiguity  of  the  prescription  used  by  Yamamura  et  al  (1984)  in  order  to  choose  the 
unique  path  has  been  clarified.  Concluding  remakrs  are  presented  in  §4. 


2.  The  atdhf  theory 

The  atdhf  theory  has  been  developed  primarily  by  Baranger  and  Veneroni  (bv)  (1978), 
Brink  et  al  (1976),  Villars  (1977),  Moya  De  Guerra  and  Villars  (1977). 

Baranger  and  Veneroni  (1978)  used  the  language  of  density  matrix  and  the  theory 
was  cast  in  a  form  independent  of  any  initial  parametrisation.  Using  the  tdhf  equation 
the  equations  of  motion  as  a  function  of  time  for  the  evolution  of  the  time-even  single¬ 
particle  density  matrix  and  the  corresponding  momentum  matrix  have  been  obtained; 
adiabaticity  in  this  work  implies  that  the  momentum  matrix  is  ‘small’  in  some  sense. 
Given  the  initial  values  of  the  time-even  density  and  the  momentum  matrix  the 
equations  of  motion  can  be  integrated  taking  small  time  steps.  Using  the  density  and 
the  momentum  matrix  the  existence  of  a  collective  Hamiltonian  has  been  established 
and  the  Schrodinger  equation  corresponding  to  the  classical  Hamiltonian  can  be  solved 
to  compute  any  observables  pertaining  to  the  collective  motion. 

In  the  formulation  of  Villars  and  coworkers  (Villars  1977;  Moya  De  Guerra  and 
Villars  1977)  the  theory  is  cast  in  terms  of  the  state  vectors  that  are  parametrised  right 
from  the  beginning  in  terms  of  two  parameters — one  to  play  the  role  of  collective  co¬ 
ordinate  q  and  the  other  that  of  the  collective  momentum  p.  Assuming  that  the 
momentum  p  is  small  which  assures  the  adiabaticity  the  tdhf  equation  has  been  used 
(Villars  1977)  perturbatively  in  powers  of  the  adiabaticity  parameter  p  and  the  hierarchy 
of  atdhf  equations  has  been  obtained  (Villars  1977).  The  zeroth-  and  first-order  atdhf 
equations,  usually  called  the  Villars  equations,  are  given  by 


<<5<D(<j)  H-~Q  <t>(q)  >  =  0, 

d 

<S<t>(q)\[H,iQ-]-P/m(q)\^(q)}=0. 


(I) 


(II) 


Here  P  and  Q  are  one-body  particle-hole  (p-h)  type  operators  and  the  former  is  the 
generator  of  displacement  in  the  time-even  Slater  determinantal  space  |4>(<j)>.  The 
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collective  potential  V{q)  and  the  collective  mass  m(q )  are  given  by, 

V(q)=  (®(q)\H\®(q)\  (la) 

m~\q)=  <f(«)|[[H>iQ],iG]|4>(?)>.  (lb) 

At  any  point  in  the  Slater  determinantal  space  defined  by  |  <I>( q) )  and  its  1  p- 1  h  excited 
states  l^ph  (<?)>,  (I)  is  an  equation  for  the  determination  of  the  ph-components  of  Q  in 
terms  of  those  of  the  given  Hamiltonian,  while  (II)  determines  the  ph-components  of 
the  operator  P  which  is  the  generator  of  displacement.  In  obtaining  the  time-even  state 
|0(4  +  Sq) )  at  the  neighbouring  point  with  the  help  of  the  generator  P  a  normalisation 
factor  (A/m)  is  required  which  is  obtained  by  choosing  the  collective  submanifold  (in 

this  case  a  single  collective  co-ordinate  q)  in  terms  of  a  measuring  operator  D, 

q  =  <<t>(q)\D\<t>(q)},  (2a) 

or  <®(?)|[D,-iP]|®(«)>  =  l.  (2b) 

It  is  to  be  emphasized  here  that  the  direction  in  the  ph-space  defined  by  the  operator  P  is 
independent  of  the  choice  of  this  normalisation. 

In  this  manner  starting  from  any  point  (not  a  critical  point)  in  the  Slater 
determinantal  space  a  unique  path  can  always  be  obtained.  By  varying  the  starting 
point  one  can  thus  generate  an  infinite  number  of  non-intersecting  paths  which  are 
solutions  of  the  Villars  equations  and  converge  at  the  nodal  point  of  the  potential 
function.  It  should  be  noted  here  that  at  the  critical  points  where  dV/dq  =  0  (I)  does 
not  determine  Q  and  hence  the  procedure  for  generating  a  solution  of  the  Villars 
equations  (I)  and  (II),  as  described  above,  does  not  go  through  if  a  critical  point  is 
chosen  in  practice  to  start  the  solution. 

In  order  to  explore  the  feasibility  of  obtaining  the  atdhf  path,  the  theory  has  been 
applied  (Moya  De  Guerra  and  Villars  1978;  Mukherjee  and  Pal  1981)  to  describe  a 
solvable  model,  namely,  the  three-level  Lipkin  model.  In  the  context  of  the  three-level 
model  the  Villars  equations  reduce  (Moya  De  Guerra  and  Villars  1978;  Mukherjee  and 
Pal  1981)  to  a  non-linear  first-order  ordinary  differential  equation  of  which  the  critical 
points  of  the  collective  potential  i.e.  the  minimum,  maximum,  the  saddle  point  etc  are 
the  singular  points.  All  the  solutions  of  the  differential  equation  passes  through  the  hf 
minimum  which  is  then  called  the  nodal  point.  With  the  use  of  a  property  regarding  the 
behaviour  of  the  solutions  of  the  differential  equation  near  the  nodal  point  it  has  been 
shown  (Mukherjee  and  Pal  1981)  analytically  that  at  the  hf  minimum  the  slopes  of  all 
the  solutions  including  the  valley  path  but  excluding  the  higher-frequency  principal  line 
are  the  same  as  that  of  the  lowest  frequency  rpa  mode  and  thus  the  non-uniqueness  of 
the  solution  of  the  Villars  equations  has  been  established. 

The  solutions  of  Villars  equations  are  shown  (Mukherjee  and  Pal  1982a)  to  represent 
the  lines  of  force  orthogonal  to  the  equipotential  surface  on  the  curved  space  of  the 
time-even  Slater  determinants  and  thus  the  non-uniqueness  of  the  solution  has  been 
established  in  general,  irrespective  of  the  model.  Through  any  point  in  the  space  of 
time-even  Slater  determinants  there  exists  a  line  of  force  representing  the  solution  of  the 
Villars  equations  at  that  point.  All  these  infinite  numbers  of  non-intersecting  solutions 
converge  at  the  hf  minimum  and  therefore  the  Villars  equations  with  the  boundary 
condition  that  the  solution  should  behave  like  the  lowest-frequency  rpa  mode  near  the 
hf  minimum  does  not  in  general  yield  a  unique  atdhf  path.  In  particular  in  the  case  of  a 
finite  dimensional  model  like  the  Lipkin  model  where  there  are  only  two  rpa  modes,  it  is 
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obvious  that  an  infinite  number  of  lines  of  force  representing  the  solution  of  the  Villars 

equations  ought  to  behave  at  the  nodal  point  as  either  of  the  rpa  modes. 

It  is  to  be  noted  here  that  in  the  atdhf  theory  one  is  specifically  interested  in 
extracting  a  collective  submanifold  (in  this  case  a  one-dimensional  path)  on  which  the 
dynamics  of  the  many-body  system  is  governed  by  the  Hamilton’s  equations  of  motion 
corresponding  to  a  classical  collective  Hamiltonian  defined  on  the  collective  sub¬ 
manifold.  It  is,  however,  intuitive  on  physical  grounds  as  has  been  recognized 
frequently  in  the  past  in  the  macroscopic  description  of  nuclear  fission  (Hill  and 
Wheeler  1953)  that  physical  systems  describing  adiabatic  motion  should  follow  the 
path  along  which  the  energy  is  minimised  i.e.  the  path  following  the  bottom  of  the 
valley.  The  valley  of  a  potential  function  V  is  defined  variationally  as, 

<5 { | grad  V\2}  —  ASV  —  0,  (3) 

i.e.  the  magnitude  of  the  potential  gradient  is  stationary  for  excursions  along  the 
equipotential  surfaces.  Here  A  plays  the  role  of  a  Lagrange  multiplier  taking  care  of  the 
constraint. 

In  view  of  the  above  non-uniqueness  of  the  solution  of  Villars  equations  and  the 
desire  to  explore  the  valley  path  as  the  optimal  atdhf  solution  Goeke  and  coworkers 
(Goeke  and  Reinhard  1978;  Reinhard  and  Goeke  1978;  Goeke  et  al  1980,  1981, 
1983a,  b)  proposed  a  validity  criteria  for  exploring  the  atdhf  path.  It  has  been 
argued  that  to  make  the  adiabatic  assumption  valid,  a  second-order  quantity  obtained 
from  the  second-order  atdhf  equation  should  be  small  compared  to  the  first-order 
contribution.  In  practice,  however,  they  find  (Goeke  et  al  1980, 1983a,  b)  that  the  ratio 
of  the  two  contribution  be  minimum  along  the  optimal  path.  Indeed  the  validity  of  the 
adiabatic  assumption  requires  that  the  second-order  quantity  should  be  small 
compared  to  the  first-order  contribution;  it  is  however  not  clear  from  theoretical 
grounds  why  their  ratio  should  be  minimum  for  the  optimal  solution  of  the  atdhf 
theory.  Nevertheless  in  numerical  calculations  (Goeke  et  al  1980,  1983a,  b)  they  have 
used  ‘trial  and  error’  method  for  exploring  the  optimal  atdhf  path. 

However,  to  describe  nuclear  fission,  the  theory  should  provide  in  a  consistent 
manner  the  optimal  path  as  the  lowest  valley  of  the  many-body  potential  energy 
surface.  It  has  been  established  by  Mukherjee  and  Pal  (1982a)  that  simultaneous 
solution  of  the  zeroth-,  first-  and  second-order  atdhf  equations  indeed  delineates  the 
lowest  valley  as  the  optimal  solution  of  the  atdhf  theory. 

The  formal  identity  of  the  atdhf  approaches  of  Villars  and  of  Baranger  and  Veneroni 
(bv)  has  been  established  (Mukherjee  and  Pal  1982a;  Pal  1984).  It  has  been 
demonstrated  that  while  the  bv  approach  mixes  the  zeroth-  and  second-order 
quantities,  Villars  neglects  the  second-order  quantity  altogether.  In  the  spirit  of  the 
perturbation  approach  the  second-order  equation, 

=0,  (III) 

should  be  considered  as  a  separate  equation.  It  has  been  shown  by  Mukherjee  and  Pal 
(1982a)  that  for  consistent  exploitation  of  the  time-dependent  equation  in  the  adiabatic 
limit  the  second-order  equation  (III)  should  be  considered  as  a  separate  equation  along 
with  the  zeroth-  and  first-order  Villars  equations  (I)  and  (II). 

Since,  as  has  been  noted  earlier,  the  zeroth-  and  first-order  equations  can  be  solved, 
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given  a  time-even  determinantal  state  as  the  initial  condition,  the  simultaneous 
fulfilment  of  the  second-order  equation  calls  for  a  consistency  condition, 

<S<J>(q)\iX2i[H,iQl  -iP]-iAw0(q)Q\®(q))  =  0  (III') 

derived  from  the  zeroth-  and  second-order  equations  to  be  satisfied.  Here, 

<Oo(q)  =  j^(,X2lm).  (4) 

It  is  a  satisfying  result  as  has  been  proved  by  Mukherjee  and  Pal  (1982a)  by  an 
analytic  method  (without  invoking  any  model)  that  the  consistent  solution  of  Villars 
equations  (I)  and  (II)  satisfying  the  consistency  condition  (III')  yields  a  unique  path.  In 
the  space  of  the  time-even  Slater  determinant  for  which  the  mass  matrix  plays  the  role 
of  the  metric  tensor  characterising  the  space,  the  solution  represents  the  valley  of  the 
collective  potential  satisfying  the  variational  equation  (3)  with  co0(q)  playing  the  role  of 
the  Lagrange  multiplier  A. 

It  should  be  noted  here  that  having  obtained  the  optimal  collective  path,  the  classical 
Hamiltonian  defined  on  the  optimal  path  can  be  established  and  with  the  use  of  the 
Schrodinger  equation  corresponding  to  the  classical  Hamiltonian  the  observables 
pertaining  to  fission  may  be  computed. 

However  it  is  to  be  noted  that  the  atdhf  theory  indeed  yields  the  Hamilton’s 
equation  of  motion  corresponding  to  the  classical  Hamiltonian, 


■&ip,q)  -  ,  .  +v(q) 

2m(q) 

(5) 

provided  the  canonicity  condition, 

<  ®to)|  [fi. -;P]|<I>(<?)>  =  1 

(6) 

is  satisfied. 

The  canonicity  condition  (6)  is  interpreted  (Villars  1977;  Goeke  and  Reinhard  1978; 
Reinhard  and  Goeke  1 978;  Mukherjee  and  Pal  1 982a,  b;  Goeke  et  al  1 980, 1 983a,  b)  as  a 
normalisation  condition  of  the  operators  Q  and  P.  Along  with  the  equation  (2b)  that 
specifies  the  collective  co-ordinates  in  terms  of  the  measuring  operator  D,  the 
canonicity  condition  (6)  has  been  shown  (Mukherjee  and  Pal  1982b;  Goeke  et  al  1980, 
1983a,  b)  to  yield  an  equation  for  determining  the  collective  mass  m(q)  consistent  with 
the  definition  (lb). 


3.  Non-uniqueness  in  the  Lipkin  model 

In  the  context  of  the  three-level  Lipkin  model  Yamamura  et  al  (1984)  obtained  the 
solution  of  the  linearised  Villars  equations  (I)  and  (II)  in  the  neighbourhood  of  a  critical 
point  which  are  in  essence  the  same  as  those  obtained  by  Mukherjee  and  Pal  (1981). 
Yamamura  et  al  (1984)  concluded  that  the  slopes  of  all  these  solutions  at  the  critical 
point  are  the  same  as  that  of  the  lowest  frequency  rpa  mode.  The  above  conclusion  is 
true  only  when  the  critical  point  is  a  nodal  point  characterised  by  the  fact  that  the 
square  of  the  rpa  eigenvalues  are  of  the  same  sign.  If  the  rpa  eigenvalues  (squared)  are  of 
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opposite  sign  characterising  the  saddle  point  the  principal  lines  which  designate  the  rpa 
mode  is  the  only  solution  (Mukherjee  and  Pal  1981). 

The  requirement  of  the  rpa  boundary  condition  demands  that,  , 

(d2V/dq2)q  =  qHf  =  Xi,  (7) 

where  A{  is  the  lowest  eigenvalue  of  the  rpa  matrix. 

The  canonicity  condition  of  (6)  linearised  about  the  critical  point  along  with  the 
substitution  of  (7)  and  the  afore-mentioned  solution  of  the  linearised  Villars  equations, 
can  be  written,  following  Yamamura  et  al  (1984)  as 

(A2-\)AU2  +  B2AU2fi  =  0.  (8) 

Here  A  and  B  are  constants  of  integration,  Al ,  A2  are  the  rpa  eigenvalues  (squared) 
with  /i  =  A2/Al  •  In  order  to  avoid  confusion  with  the  operator  Q  a  new  notation  £  =  q 
—  qHF  has  been  used  in  place  of  the  notation  Q  of  Yamamura  et  al  (1984). 

It  is  to  be  noted  here  that  in  deriving  (8)  the  orthonormality  of  the  rpa  eigenvectors 
with  respect  to  the  metric  M_1  (Mukherjee  and  Pal  1981)  has  been  used. 

It  has  been  argued  by  Yamamura  et  al  (1984)  that  (8)  is  satisfied  if  and  only  if, 

A2  &  1,  (9a) 

B  =  0.  (9b) 

Condition  (9)  is  a  particular  solution  of  the  linearised  Villars  equations  namely  the 
principal  line  designating  the  lo west-frequency  rpa  mode.  Yamamura  et  al  (1984) 
concluded  that  the  solution  of  the  Villars  equations  (I)  and  (II)  satisfying  the  canonicity 
condition  (Baranger  and  Veneroni  1978)  yields  a  unique  path  which  is  given  by  the 
solution  of  the  atdhf  equations  (I)  and  (II)  satisfying  the  boundary  condition  that  the 
solution  in  the  neighbourhood  of  the  critical  point  is  given  by  the  lowest-frequency 
principal  line  characterised  by  (9).  The  lacuna  in  this  argument  of  Yamamura  et  al 
(1984)  will  now  be  clarified. 

Let  us  discuss  the  linearisation  of  the  Villars  equations  (I)  and  (II)  about  the  static 
Hartree-Fock  (hf)  state.  Since  (I)  is  satisfied  for  all  values  of  q  its  derivative  with  respect 
to  q  should  also  be  satisfied, 

<  MH.4)  |  [H,  -  iP]  -  Cl (q)Q  -  X(q)  ~  |  >  =  0.  (10) 

Here  A(q)  =  dV/dq  and  C(q )  =  dA/dq.  At  the  critical  point  of  the  potential  where  A(q) 
=  0  (I)  does  not  determine,  as  noted  earlier  after  (2b),  the  p-h  components  of  Q  ;  instead 
at  that  point  it  represents  the  static  hf  condition.  In  the  neighbourhood  of  the  hf  point  if 
the  curvature  of  the  path  is  neglected  then  dQJdq  =  0  and  (10)  can  be  written  as, 

<<5<D(4)|[7/,  -iP]-C(q)Q\®(q)>  =  0.  (11) 

Note  that  exactly  at  the  hf  point,  A{q)  =  0,  the  third  term  in  (10)  drops  out  and  (1 1)  is 
exact. 

Equation  (11)  along  with  (II)  can  easily  be  reduced  to  the  rpa  eigenvalue  equation, 

R|*hp>  =«,(«)  I  *hp>,  (12) 

where  co(q)  =  C(q)/m(q)  and  R  is  the  rpa  matrix.  We  thus  conclude  that  at  the  hf  point 
the  solutions  are  the  rpa  modes  and  the  lowest  energy  solution  is  the  lowest  rpa  mode. 
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However,  due  to  the  curvature  effect,  this  does  not  preclude  the  possibility  of  many 
solutions  in  the  neighbourhood  of  the  hf  point  terminating  at  that  point  with  the  slope 
of  the  pure  lowest  rpa  mode. 

The  curvature  term  (dQ/dq)  in  (10)  cannot  be  neglected  in  the  neighbourhood  of  the 
hf  point  as  can  be  seen  from  the  second-order  equation  (III),  which  gives  (Mukherjee 
and  Pal  1982a) 

<  Wq) \d~  | <■>(<?)  >  =  ~  < mq) | [ [W,  iQ\ iQ]  -  ^  Q \ <!>(<?)  >.  (13) 

It  therefore  follows  that  although  the  solution  of  the  atdhf  equations  in  the  limit 
(q  qHF)  behaves  like  the  decoupled  rpa  solution,  in  the  neighbourhood  of  the  hf 
minimum,  the  solution  cannot  be  given  by  a  pure  rpa  mode  but  is  a  superposition  of  the 
rpa  modes  which  is  dictated  by  the  non-vanishing  curvature. 

The  above  assertion  has  been  demonstrated  in  the  context  of  the  three-level  Lipkin 
model  by  Mukherjee  and  Pal  (1981)  and  in  order  to  clarify  further  we  reproduce  here 
(figure  1)  one  of  the  figures  of  Mukherjee  and  Pal  (1981)  where  the  iowest-frequency 
rpa  mode  is  shown  as  the  line  HR  which  excepting  at  the  hf  point  is  not  an  atdhf 
solution.  In  the  atdhf  numerical  calculations  (Goeke  et  al  1980, 1983a,  b)  of  the  fusion 
of  O16  +  O16  this  ambiguity  of  starting  the  solution  at  the  hf  state  of  the  compound 
system  has  been  clearly  felt  and  the  optimal  path  is  obtained  by  trial  and  error  method 
starting  from  a  point  far  off  from  the  hf  state. 


S1  M 


Figure  1.  The  entire  topography  of  atdhf  paths  (continuous  lines)  along  with  the  contour 
plot  (dashed  lines)  of  V(01,02)  for  the  case  x  >  (n  -F  1)  (x  =  3,  n  =  1)  with  the  maximum  M, 
minimum  IJ  and  the  two  saddle  points  Sx  and  S2  of  V(61  ,d2)  are  shown.  The  optimal  path 
following  the  bottom  of  the  valley  is  shown  by  heavy  dots.  The  line  HR  (parallel  to  thetf2-axis) 
represents  the  Iowest-frequency  rpa  mode. 
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In  the  adiabatic  motion  the  nuclear  system  will  in  general  start  oscillations  in  a 
superposed  mode  and  is  expected  to  choose  an  optimal  path  along  which  the  energy  is 
minimised.  This  is  the  path  which  satisfies  the  consistency  condition  (III  ).  We  therefore 
conclude  that  instead  of  constraining  the  system  to  choose  a  pure  rpa  mode,  the  system 
should  be  given  the  freedom  to  choose  its  lowest  energy  path  which  is  obtained 
numerically  by  satisfying  the  canonicity  condition.  Constraining  the  solution  de¬ 
liberately  as  is  done  in  Yamamura  et  al  (1984)  may  be  at  best  a  good  mathematical 
exercise  with  no  utility  for  the  physics  of  the  large-amplitude  nuclear  collective  motion 
for  which  delineating  the  valley  path  is  to  be  set  as  a  physically  more  desirable  goal. 
Even  within  the  three-level  model  the  solution  generated  with  the  mathematical 
constraint  of  Yamamura  et  al  (1984)  departs  considerably  from  the  valley  path  as  seen 
from  the  figure. 

In  the  context  of  the  three-level  Lipkin  model  it  thus  follows  from  the  above 
discussions  that  even  in  the  neighbourhood  of  the  hf  point,  the  principal  lines  are  not  a 
solution  to  the  atdhf  equations  (I)  and  (II)  which  in  the  Lipkin  model  reduce  to  a 
non-linear  first-order  ordinary  differential  equation  and  thus  the  prescription  of  solving 
(I)  and  (II)  or  equivalently  the  non-linear  differential  equation  with  a  boundary 
condition  of  particular  solution  (in  the  neighbourhood  of  the  hf  point)  which  itself  is 
not  a  solution  to  the  atdhf  equations  is  not  clear  to  the  present  authors. 

In  our  view  (8)  does  not  always  imply  that  the  condition  (9)  should  be  satisfied 
because  of  the  following  reason.  The  solutions  of  Villars  equations  (I)  and  (II)  are 
obtained  by  linearising  (I)  about  the  critical  point  and  therefore  in  essence  is  equivalent 
to  (1 1)  in  which  the  curvature  has  been  neglected.  Further  the  canonicity  condition  is 
also  linearised  about  the  critical  point.  In  such  a  linearised  atdhf  theory  we  can  only 
demand  that  (8)  should  be  satisfied  only  in  the  limit  ({  — ►  0).  If  the  limit  ({  -►  0)  of  (8)  is 
utilised  two  situations  arise: 

(i)  If  p  >  0  characterising  the  nodal  point  then  in  the  limit  (£  -►  0)  (8)  is  satisfied  for 
all  values  of  A  and  B  including  those  of  (9)  and  thus  the  principal  line  is  not  the  only 
solution  that  satisfies  the  canonicity  condition  (8)  but  a  whole  host  of  solutions  of 
Villars  equations  along  with  the  principal  line  satisfies  the  canonicity  condition  and 
behave  like  the  lowest-frequency  rpa  mode  near  the  hf  minimum. 

(ii)  If  p  <  0  characterising  the  saddle  point  then  in  the  limit  ({  ->  0)  (8)  implies  that  (9) 
must  be  satisfied  and  thus  the  principal  line  is  the  only  solution. 

Thus  the  assertion  of  Yamamura  et  al  (1984)  that  in  the  neighbourhood  of  a  critical 
point  the  only  solution  of  Villars  equations  that  satisfies  the  canonicity  condition  (6) 
and  behave  like  the  lowest  frequency  rpa  mode  is  the  lowest-frequency  principal  line  is 
true  only  when  the  critical  point  is  a  saddle  point  but  not  a  nodal  point  and  therefore  the 
results  established  in  our  work  (Mukherjee  and  Pal  1981,  1982a,  b)  contrary  to  the 
criticism  of  Yamamura  et  al  (1984)  are  correct  and  valid. 


4.  Conclusions 

In  this  paper  the  non-uniqueness  of  the  solution  of  Villars  equations  satisfying  the 
canonicity  equation  and  the  rpa  boundary  condition,  in  contrast  to  the  criticism  due  to 
Yamamura  et  al  (1984)  has  been  established.  The  adiabatic  time-dependent  Hartree- 
Fock  theory  with  the  consistency  condition  which  delineates  in  a  consistent  manner  the 
‘fission  path’  desired  by  workers  in  this  field  has  been  summarised. 
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Until  now  the  application  of  the  atdhf  theory  to  realistic  calculations  has  been 
limited  to  the  fusion  of  two  heavy-ions;  the  work  has  been  carried  out  by  Goeke  et  al 
(1980,  1983a,  b).  However  the  atdhf  theory  is  most  suitable  for  describing  nuclear 
fission  as  is  clear  from  the  text.  So  far  it  has  not  been  applied  to  nuclear  fission.  Though 
theoretical  investigations  are  required  to  incorporate  some  of  the  relevant  features  of 
fission,  the  numerical  application  of  the  theory  to  nuclear  fission  is  the  challenging 
problem  to  nuclear  theorists  in  recent  times. 
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Nucleon  transport  processes  in  fission  and  heavy  ion  reactions 

S  S  KAPOOR  and  V  S  RAMAMURTHY 

Bhabha  Atomic  Research  Centre,  Bombay  400085,  India 

Abstract.  The  nucleon  exchange  process  between  two  nuclei  in  close  proximity  and  its 
application  to  an  explanation  of  fragment  mass  and  charge  distributions  in  fission  and  in  heavy 
ion  deep  inelastic  collisions  are  reviewed.  An  analysis  of  the  measured  correlations  between  the 
energy  loss  from  relative  motion  and  the  fragment  mass  and  charge  variances  in  the  heavy  ion 
deep  inelastic  collisions  is  presented.  The  recent  data  on  fragment  mass  and  charge  variances  as 
a  function  of  the  fragment  kinetic  energy  in  thermal  neutron  induced  fission  of  235U,  lends 
added  support  to  the  hypothesis  that  the  nucleon  transport  process  plays  a  similar  role  both  in 
fission  and  in  heavy  ion  deep  inelastic  collisions. 

Keywords.  Nucleon  transport  process;  fission;  heavy  ion  reaction;  stochastic  theory;  nucleus- 
nucleus  collision. 

PACS  No.  25  70;  25  85 


1.  Introduction 

Nucleon  exchange  between  the  two  nascent  fragments  formed  during  the  fission 
process  near  the  scission  point  as  the  mechanism  leading  to  the  observed  m$ss 
distributions  in  fission  was  first  proposed  by  Ramanna  (1964)  nearly  two  decades  ago. 
More  detailed  models  (Ramanna  et  al  1965;  Ramanna  and  Ramamurthy  1969; 
Ramamurthy  1971;  Prakash  et  al  1980)  based  on  the  above  nucleon  exchange 
mechanism  were  subsequently  developed  to  satisfactorily  explain  the  general  features 
of  the  fragment  mass  and  charge  distributions  in  fission.  In  recent  years,  with  the 
availability  of  a  variety  of  accelerated  heavy  ion  beams  to  study  heavy  ion-induced 
nuclear  reactions,  the  nucleon  exchange  process  taking  place  between  two  nuclei  in 
close  proximity  has  found  additional  experimental  support  from  the  study  of  deep 
inelastic  heavy  ion  collisions.  In  fact,  studies  of  nucleus-nucleus  collisions  at  medium 
energies  have  now  made  possible  investigations  of  the  nucleon  transport  processes  in  a 
systematic  and  almost  controlled  manner.  During  the  last  decade,  the  large  body  of 
experimental  data  on  the  conversion  of  the  relative  kinetic  energy  into  fragment 
excitation  energies  (commonly  referred  to  as  energy  dissipation  or  energy  loss),  transfer 
of  orbital  angular  momentum  to  the  fragment  spins  and  nucleon  transfers  between  the 
target  and  the  projectile  in  a  nucleus-nucleus  collision  have  been  analysed  in  terms  of 
the  relevant  transport  coefficients  which  have  found  explanation  in  terms  of  statistical 
exchange  of  nucleons  between  the  target  and  the  projectile  (see  reviews  by  Schroder  and 
Huizenga  1977;  Lefort  and  Ngo  1978;  Weiden  Muller  1980;  Gobbi  and  Norenberg 
1980;  Kapoor  1982  and  references  cited  therein).  These  studies  have  also  established 
that  a  significant  number  of  nucleon  exchanges  take  place  in  the  very  short  interaction 
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times  of  few  times  10“ 21  sec  involved  in  the  deep  inelastic  collisions.  In  this  work,  we 
present  a  brief  review  of  the  nucleon  exchange  process  operating  between  two  nuclei  in 
proximity,  and  its  application  to  the  fission  process  and  heavy  ion  reactions. 


2.  Stochastic  theory  of  fragment  mass  and  charge  distributions  in  fission 


The  application  of  the  nucleon  exchange  mechanism  for  an  explanation  of  the  fragment 
mass  and  charge  distributions  in  fission  is  based  on  the  fact  that  during  the  last  stages  of 
the  fission  process,  the  fissioning  nucleus  can  be  well  approximated  by  two  weakly 
interacting  nascent  fragments.  If  one  assumes  that  the  relative  motion  of  the  nascent 
fragments  is  slow  compared  to  the  characteristic  time  for  a  nucleon  transfer,  the  process 
can  be  treated  as  a  stochastic  process  where  not  only  the  nascent  fragments  are  in 
thermodynamic  equilibrium  within  themselves  but  they  also  attain  mutual  equilibrium 
among  themselves  with  respect  to  nucleon  and  energy  transfers.  The  observed 
distributions  of  the  fragment  mass  and  charge  then  correspond  to  the  equilibrium 
distribution  near  the  scission  point.  Ramanna  and  coworkers  (Ramanna  et  al  1965; 
Ramanna  and  Ramamurthy  1969)  have  given  the  theoretical  formulation  of  the 
nucleon  exchange  process  between  the  two  nascent  fragments  for  the  calculation  of  the 
equilibrium  mass  and  charge  distributions.  Considering  for  the  sake  of  illustration  a 
one-component  system,  the  configuration  of  the  fissioning  nucleus  at  any  instant  near 
the  scission  point  can  be  defined  by  specifying  the  number  of  nucleons  on  either  side. 
Let  coM  be  the  probability  that  the  fissioning  nucleus  has  a  configuration  with  M 
nucleons  in  the  heavy  side,  the  number  of  nucleons  in  the  light  side  being  determined 
from  the  total  nucleon  number  conservation.  If  PM  M  denotes  the  probability  that  a 
configuration  with  M '  nucleons  in  the  heavy  side  goes  over  to  the  configuration  with  M 
nucleons  in  the  same  side  in  a  small  interval  of  time  At,  one  has 


wM(r  +  A<)  = 

M' 


By  definition 


1  and  X>«=1 

M  M 

Under  the  condition  of  complete  equilibrium  in  the  mass  asymmetry  degree  of 
freedom,  one  has  wM  ( t  +  At)  =  wM  (t).  If  the  unit  of  time  At  is  sufficiently  small  and  one 
can  neglect  cluster  transfers 


FM  M  =  0,  M^M'orM'±l 


It  has  been  shown  that  under  these  conditions  the  probabilities  wM  follow  the  simple 
relation 


w 


M+  1 


/WM  —  P  M,M+\/ P 


M+\,M 


Thus  the  ratio  of  the  probabilities  for  adjacent  configurations  M  and  M  - 1-1  are 
simply  equal  to  the  ratio  of  the  nucleon  transfer  probabilities  in  the  directions  M  -►  M 
+  1  and  M  +  1  -►  M.  A  generalization  of  (1)  in  two  dimensions  to  consider  both  proton 
and  neutron  transfers,  gives 

wjv,z  T  ^0  =  X  Lwn',z’  (O-Pjv'z'./vz 

N'  Z' 


(2) 
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The  steady  state  solution  of  w  is  given  by 

W  N,Z  ~  N',Z'P  N'Z',NZ  (3) 

N'  T 

The  probabilities  w  and  P  are  also  subject  to  the  constraints 

II>N,Z=1  '  (4) 

N  Z 

X  X  Pn'Z',NZ  ~  1  (5) 

N  Z 

Assuming  single-nucleon  transfers,  one  gets 

Pnz,nz  =  0  NfiN\N'±  1  (6) 

ZfiZ\Z'±  1 

Equation  (3)  can  be  solved  numerically  to  obtain  the  probability  distribution  w, 
provided  the  transition  probabilities  are  known.  It  is  seen  from  the  above  equations 
that  the  central  quantities  which  decide  the  mass  and  the  charge  distributions  are  the 
transition  probabilities  PM  M  and  PN  Z  Nz  which  are  in  turn  given  by  the  single-nucleon 
transfer  probabilities  from  one  of  the  fragments  to  the  other. 

Figure  1  illustrates  the  mechanism  of  nucleon  transfers  between  the  two  nascent 
fragments.  If  the  fragments  are  cold,  the  direction  of  spontaneous  transfer  of  nucleons 
is  from  the  fragment  having  the  higher  Fermi  energy  to  the  one  having  lower  Fermi 
energy.  Even  if  the  nascent  fragments  have  some  excitation  energy,  a  tendency  for  a 
preferential  transfer  of  nucleons  in  the  direction  of  decreasing  chemical  potential 


L  H 


Figure  1.  Schematic  diagram  illustrating  the  nucleon  exchange  mechanism  between  the 
fragments  L  and  H.  pLjH(r)  represents  the  diffuse  densities,  gL/H  are  the  single  particle  level 
densities,  fLH  are  the  Fermi-Dirac  occupation  probabilities  and  pL/H  are  the  chemical 
potentials  of  the  fragments  L  and  H  respectively. 
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persists.  A  quantitative  estimate  of  the  relative  probabilities  of  nucleon  transfers  in  both 
directions  can  be  made  as  (Prakash  et  al  1980;  Ramanna  and  Ramamurthy  1969) 


M,M+  1 


2n 

Y 


\gL{E)fL(E)gH(E)[_  1  -/„(£)]  \MLH\2dE 


P 


M,M— 1 


gH(E)fH(E)gL(E)[\  -/,(£)] 


Here  gL(E )  and  gH  (E)  are  the  single  particle  energy  level  densities  and  fL ( E )  and  fH  (E) 
are  the  Fermi-Dirac  occupation  probabilities  in  the  light  and  the  heavy  nascent 
fragment  respectively.  The  matrix  elements  of  transfer  M LH  and  MHL  are  equal  because 
of  microscopic  reversibility.  Since  the  main  contribution  to  the  integral  comes  from  a 
small  energy  band  around  the  chemical  potentials  of  the  two  nascent  fragments,  the 
quantities  gL,  gH  and  the  matrix  element  of  transfer  can  be  calculated  at  the  mean 
chemical  potential  and  taken  out  of  the  integral.  One  then  has 


2  n 


M,M  +  1 


M\2gL(p)gH(p)I 


LH 


where 

ILH  =  !/.(£)[!  -/„(£)]d£  (10) 

and  Ji  is  the  mean  chemical  potential.  A  similar  expression  for  ?M  M_,  can  also  be 
written.  For  a  nearly  degenerate  system  like  a  nucleus  having  a  temperature  much  less 
than  the  mean  chemical  potential,  one  gets 


j  (^h  l ) 

LH  e^H-Hi)/T- 1 


(11) 


where  pH  and  pL  are  the  chemical  potentials  of  the  two  nascent  fragments.  Thus  the 
main  driving  force  for  the  net  transfer  of  nucleons  from  one  of  the  fragments  to  the 
other  is  the  difference  in  their  chemical  potentials  though  a  finite  temperature  induces 
nucleon  transfers  in  both  directions. 

In  the  limit  of  zero  temperature,  the  difference  in  the  chemical  potentials  is  equal  to 
the  negative  of  the  difference  in  the  nucleon  separation  energies  of  the  two  nascent 
fragments,  neglecting  the  small  rearrangement  energies.  For  finite  temperatures,  one 
can  include  a  temperature  dependence  of  the  chemical  potential  in  such  a  way  that  the 
shell  effects  on  the  chemical  potential  vanish  at  high  temperatures.  Figure  2  shows  a 
typical  plot  of  the  difference  in  the  chemical  potentials  versus  the  mass  ratio  for  the  case 
of  236U  fission  (Prakash  et  al  1980).  The  fact  that  the  difference  (pH  —  gL)is  negative  for 
all  configurations  upto  neutron  number  N  =  86  and  is  positive  for  all  configurations 
above  N  =  86  is  responsible  for  the  increasing  yields  of  the  heavy  fragments  upto  N 
=  86  and  a  decrease  in  the  yield  thereafter.  Figure  3  shows  typical  mass  distributions 
for  the  fission  of  226Ra,  252Cf  and  256Fm  as  calculated  from  this  model.  It  can  be  seen 
that  the  calculations  reproduce  the  known  qualitative  features  for  all  cases  and  even 
such  details  as  the  triple  hump  for  226Ra  fission  are  reproduced. 

One  of  the  early  criticisms  of  the  nucleon  exchange  model  for  explaining  the 
fragment  mass  and  charge  distributions  in  fission  was  whether  sufficient  time  is 
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Figure  2.  Difference  in  the  chemical  potentials  of  the  light  and  heavy  fragments  as  a 
function  of  the  neutron  number  of  the  heavy  fragment.  The  individual  separation  energies  are 
calculated  for  the  most  probable  deformations  of  the  light  and  heavy  fragments.  Results  are 
shown  for  mean  fragment  temperatures  T  =  0-5,  10  and  2-0  MeV. 


available  for  an  appreciable  number  of  nucleon  transfers  to  take  place  during  the 
descent  from  the  saddle  to  the  scission  point.  Direct  experimental  evidence  for 
substantial  nucleon  exchange  between  two  nuclei  in  close  proximity  in  interaction  times 
of  the  order  of  10“  21  sec  has  now  come  from  the  studies  of  heavy  ion  deep  inelastic 
collisions. 


3.  Nucleon  exchange  processes  in  nucleus-nucleus  collisions 

It  is  now  well  known  that  in  reactions  induced  by  heavy  ions,  a  sizable  cross-section 
appears  in  a  new  process  called  deep  inelastic  collisions.  These  reactions  are 
characterised  by  a  rather  short  interaction  time  of  few  times  10" 21  sec  during  which  a 
large  fraction  of  the  kinetic  energy  of  the  colliding  nuclei  gets  converted  into  intrinsic 
fragment  excitation  energies.  In  addition,  a  large  number  of  nucleons  are  also 
exchanged  between  the  target  and  the  projectile  as  inferred  from  the  observed  width  of 
the  mass  and  charge  distributions  of  the  target-like  and  the  projectile-like  binary 
fragments  emitted  as  the  reaction  products.  An  important  observation  is  the  presence  of 
strong  correlation  between  various  quantities  such  as  the  kinetic  energy  loss  from 
relative  motion,  fragment  mass  and  charge  widths  and  the  angular  momentum  transfer 
(see  for  example,  Wollersheim  et  al  1982;  Dyer  et  al  1980;  Dakowski  et  al  1982).  A 
nucleon  exchange  mechanism  similar  to  the  one  proposed  earlier  for  an  explanation  of 
fragment  mass  and  charge  distributions  in  fission  has  been  found  to  be  useful  in 
bringing  out  the  main  features  of  the  deep  inelastic  collision  process.  One  difference 
between  the  earlier  model  for  fission  and  that  for  deep  inelastic  collisions  comes  from 
the  effect  of  the  relative  motion  of  the  two  ions  exchanging  the  nucleons.  In  addition, 
unlike  the  case  of  fission,  in  deep  inelastic  collisions  the  number  of  nucleon  transfers  are 
not  sufficient  to  attain  complete  equilibration  in  the  nucleon  exchange  degree  of 
freedom.  Under  these  conditions,  it  is  advantageous  to  define  the  nucleon  drift  and  the 
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Figure  3.  Calculated  fragment  mass  yields  for  the  fission  of  226Ra,  252Cf  and  256Fm  with  a 
mean  fragment  temperature  of  T  =  05  MeV. 


diffusion  coefficients  within  the  Fokker-Planck  approximation  (Prakash  et  al  1981). 
Theoretical  formulations  of  the  nucleon  exchange  process  between  two  ions  in  relative 
motion  and  the  resulting  transport  coefficients  can  be  found  in  the  recent  literature 
pertaining  to  the  heavy-ion  deep  inelastic  collisions. 

Considering  the  relative  motion  between  the  colliding  nuclei,  the  fermion  nature  of 
the  exchanged  particles  and  the  associated  Pauli  blocking  effect,  it  has  been  shown 
(Randrup  1977)  that  the  diffusion  of  nucleons  between  the  two  nuclei  results  in  a  time 
variation  of  the  width  of  the  fragment  mass  distribution  as 

dajdt  =  NXwcoth— ;>f  (12) 

where  N'A  is  the  differential  total  particle  current,  w  is  the  change  in  excitation  energy 
associated  with  the  transfer  of  a  nucleon,  Tis  the  temperature  of  the  system.  The  suffix  F 
denotes  that  the  average  is  taken  around  the  mean  value  of  the  Fermi  energies  of  the 
two  nuclei.  It  is  easy  to  see  that  the  quantity  w,  denoting  the  change  in  excitation  energy 
on  account  of  transfer  of  a  nucleon  from  one  nucleus  to  the  other  having  a  relative 
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velocity  v  is  given  by 

w  =  FA~  P-v 

where  FA  is  the  difference  in  the  Fermi  energies  and  p  is  the  momentum  of  the 
transferred  nucleon. 

The  rate  of  the  resulting  energy  dissipation  is  given  by 
dE 

"d J  =  N*<w>f  03) 


In  most  cases  of  the  deep  inelastic  heavy  ion  collisions,  it  is  a  good  approximation  to 
take 

coth  ( <w2  )f)l/2T  se  1 
and  (12)  is  then  often  approximated  as 

do2A/dt  *  N^(<w2)f)l  (14) 

From  (13)  and  (14)  it  then  follows  that 

—  dF/dcr2  =  (<w2)f)i  (15) 

For  peripheral  collisions,  where  most  of  the  nucleon  transfers  take  place,  (15)  reduces  to 


—  dE/doj 


m 


(x  —E 
E 


(16) 


where  m  is  the  nucleon  mass,  n  is  the  reduced  mass  of  the  di-nuclear  complex  and  EF  is 
the  average  of  the  Fermi  kinetic  energies  of  the  two  nuclei.  E  is  the  kinetic  energy  in 
relative  motion  above  the  Coulomb  barrier  Vc  given  by 

E  =  E  ~  V  (R  )-  F 

cm  £,]oss 

Rint  refers  to  the  interaction  distance  between  the  colliding  ions,  and  £loss  is  the  energy 
loss  from  the  relative  motion.  In  a  slightly  different  approximation,  one  obtains 

da2Jdt  as  N'a  <  |  w  |  )F 

which  when  flux-averaged  gives  an  additional  factor  of  3n/&  over  that  of  (16).  Although 
we  base  the  following  discussion  on  the  use  of  (16),  the  main  conclusions  would  remain 
unaltered  if  the  factor  3rc/8  is  included.  For  the  classical  case  of  exchanged  particles 
being  motionless  in  the  individual  nuclear  containers,  a  will  be  unity.  Thus,  the  value  of 


in  (1 6)  results  from  the  consideration  of  the  Fermi  nature  of  the  exchanged  particles  and 
the  Pauli  blocking  effect. 

On  integrating  (16),  one  gets 

£1/2  «  ££/2-^^£f^  <t2  (17) 

The  Fermi  energy  EF  is  a  reasonably  well-known  quantity  ( ~  37  MeV),  and  thus  the 
above  predictions  of  the  nucleon  exchange  model  do  not  involve  any  free  parameter. 
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Before  we  discuss  the  comparison  of  the  experimental  results  with  the  predictions  of 
( 1 7)  some  additional  facts  must  be  pointed  out.  In  most  experiments  the  quantity  which 
is  measured  is  <r|,  the  variance  of  the  fragment  charge  distribution  and  not  o\. 
Conversion  of  g  2  into  g\  requires  a  knowledge  of  the  neutron-proton  correlation  in  the 
exchange  process.  For  uncorrelated  transfers  g2  —  (A/Z)g2  while  for  fully  correlated 
transfers  o2  —  (A/Z)2g2z.  Thus,  in  some  earlier  studies  the  comparison  of  the 
correlation  between  the  measured  g\  and  the  energy  loss  with  the  predictions  of  the 
above  transport  model  involved  some  ambiguity  as  a  result  of  uncertainty  in  the 
conversion  of  g\  into  g2  representing  the  total  number  of  nucleon  exchanges.  Further 
studies  of  this  problem  carried  out  at  Trombay  (Kapoor  and  De  1982;  De  and  Kapoor 
1983)  have  clarified  that  it  is  possible  to  make  comparisons  of  experimental  results  on 
g\  with  the  theoretical  predictions  of  the  transport  model  without  much  ambiguity 
arising  from  the  degree  of  neutron-proton  correlation.  It  was  pointed  out  in  this  work 
that  if  there  is  a  neutron-proton  correlation  in  the  exchange  process,  not  only  should 
the  experimental  g\  be  suitably  converted  into  g2  but  one  must  also  consider  that  the 
theoretical  predictions  of  the  transport  model  also  get  modified.  It  was  shown  (Kapoor 
and  De  1982;  De  and  Kapoor  1983)  that  while  in  the  case  of  uncorrelated  neutron- 
proton  transfers,  the  variance  of  the  fragment  mass  distribution  g2  is  equal  to  the  total 
number  of  nucleon  transfers,  for  fully  correlated  motion,  the  value  of  g2  is  larger  than 
the  number  of  nucleon  transfers  by  a  factor  A2/2ZN.  If  one  considers  the  correlated 
and  uncorrelated  motion  for  a  given  number  of  particle  exchanges,  the  rate  of  energy 
dissipation  remains  unaltered.  It  then  follows  that  the  theoretical  value  of  dE/dG2  for 
correlated  transfer  is  related  to  the  expression  for  uncorrelated  transfers  as 

[dE/da*  ]corr  =  [dE/da2A  ]  unco[r  ( 1 8) 

The  experimental  results  on  the  variance  of  the  fragment  charge  distributions  versus 
energy  loss  for  several  heavy  ion  reactions  were  analysed  earlier  (Kapoor  and  De  1982; 
De  and  Kapoor  1983;  Kapoor  1982a)  by  including  the  effect  of  correlations  in  the 
transport  model  in  a  manner  which  is  consistent  with  the  assumption  made  in  the 
transformation  of  g\  into  experimental  g2.  Figures  4  and  5  show  the  comparisons  with 
the  experimental  results  for  cases  of  heavy  ion  reactions  of  Xe  +  Bi,  Pb  -f  U,  and  Pb 
T  Pb.  The  dashed  curve  in  figure  4  represents  the  theoretical  curve  if  one  is  not 
consistent  in  applying  the  effect  of  correlation  in  neutron-proton  exchanges  in  both  the 
transport  model,  and  in  the  relationship  between  <rjand  experimental  g2.  The  good  fits 
of  the  experimental  points  to  the  solid  lines  in  the  figures  show  satisfactory  agreement 
with  model  predictions,  nearly  independent  of  the  assumptions  made  about  neutron- 
proton  correlations  in  the  exchange  process,  bringing  out  unambiguously  the  dominant 
role  of  nucleon-exchange  process  in  the  energy  loss. 

The  influence  of  the  neutron-proton  correlation  in  the  evolution  of  the  fragment 
mass  and  charge  distributions  in  fission  has  been  studied  recently  at  Trombay  (Rekha 
Govil  et  al  1983)  Making  use  of  a  back-to-back  A E  —  E  detector  arrangement,  the 
correlation  between  the  variances  in  the  fragment  mass  and  charge  distributions  versus 
the  total  fragment  kinetic  energy  was  studied  in  the  case  of  thermal  neutron  induced 
fission  of  235 U.  Figure  6  shows  the  measured  correlations  and  shows  a  rather  close 
resemblance  to  the  known  systematics  (Breuer  et  al  1979)  in  the  case  of  heavy  ion  deep 
inelastic  collisions,  namely,  the  ratio  g2/g2  is  close  to  (T/Z)2  for  low  kinetic  energies 
and  approaches  (A/Z)  for  higher  kinetic  energies.  This  similarity  lends  support  to  the 
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Figure  4.  Plots  of  £1/2  versus  o2z  for  three  bombarding  energies  of  136Xe  on  209Bi. 

hypothesis  that  the  final  mass  and  charge  distributions  in  fission  as  well  as  in  dic  are 
governed  by  similar  processes,  namely,  a  nuclear  exchange  process. 

There  has  also  been  some  discussion  in  the  earlier  work  on  the  question  of  the  nuclear 
shell  effects  on  the  nucleon  exchange  process.  In  the  analysis  of  the  heavy  ion  deep 
inelastic  collision  data  with  ( 1 7),  the  single  particle  states  of  the  nucleons  are  assumed  to 
be  given  by  a  smooth  distribution,  and  therefore  nuclear  shell  effects  are  neglected. 
Good  fits  obtained  above  for  the  cases  of  Bi  +  Xe  and  Pb  +  U  do  indicate  that  the  shell 
effects  have  little  influence  on  the  above  description  which  may  partly  be  due  to  the 
washing  out  of  the  shell  effects  at  finite  temperatures  reached  during  the  nucleon 
exchange  processs.  However,  for  the  case  of  Pb  +  Pb,  where  both  the  colliding  nuclei 
have  doubly  closed  shells  and  where  we  are  dealing  with  nuclei  which  have  highest  shell 
correction  energy,  noticeable  deviations  from  the  predictions  of  the  Fermi  gas  model 
are  apparent  in  figure  5.  Although  bulk  of  the  energy  loss  in  heavy  ion  collisions 
originates  from  the  nucleon  exchange  process,  it  is  also  to  be  expected  that  some  energy 
loss  arises  from  other  competing  mechanisms.  One  effect  of  the  energy  gap  at  the 
Fermi-surface,  that  is,  of  shell  effects,  would  be  to  slow  down  the  nucleon  exchange 
process,  on  account  of  a  smaller  density  of  states  at  the  Fermi  surface.  This  could  then 
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Figure  5.  Plots  of  Ex/2  versus  f°r  7-OMeV/u  Pb  +  Pb,  7-6MeV/u  Pb  +  Pb  and 
7-5  MeV/u  Pb  +  U. 

relatively  enhance  the  importance  of  other  mechanisms  competing  with  the  nucleon 
exchange  process,  resulting  in  deviations  of  the  theoretical  predictions  from  the 
experimental  results.  However,  this  question  of  nuclear  shell  effects  in  the  nucleon 
exchange  process  needs  to  be  further  investigated. 

Thus,  the  main  conclusions  resulting  from  the  above  discussions  are  as  follows: 

When  the  effect  of  isospin  correlations  in  the  exchange  process  is  considered  both  in 
the  conversion  of  measured  a\ into  experimental  o24,  and  also  in  the  transport  model,  it 
turns  out  that  comparison  of  c\  versus  energy  loss  with  the  model  is  almost 
independent  of  the  degree  of  the  isospin  correlation  in  the  exchange  process.  Thus  such 
a  comparison  focusses  on  the  magnitude  of  the  energy  loss  arising  from  the  exchange 
process  without  much  ambiguity  due  to  the  presence  of  isospin  correlation.  The 
analysis  of  a  number  of  heavy-ion  systems,  supports  the  above  conclusion  and  also 
brings  out  in  an  unambiguous  way  that  the  observed  energy  loss  can  be  accounted 
primarily  by  the  nucleon  exchange  mechanism  alone  without  having  to  invoke  any 
arbitrary  parameter.  For  most  of  the  systems  including  Xe  +  Bi  and  Pb  +  U  where  the 
target  or  the  projectile  nuclei  have  significant  negative  shell  correlation  energies,  there  is 
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Figure  6.  Kinetic  energy  dependence  of  the  fragment  charge  variance  c\  (a),  mass  variance 
aA(b)  and  the  ratio  o2A/oz  (c)-  The  dotted  horizontal  lines  are  values  of  (A/Z)2  and  (A/Z)  for 
the  236U  compound  nucleus. 


no  evidence  for  significant  shell  effects  in  the  observed  correlations  between  the  energy 
loss  and  o\.  However,  for  Pb  +  Pb  system,  some  deviations  from  the  predictions  of  the 
model  are  observed  which  need  further  investigations. 

In  the  above  discussion,  we  have  focussed  on  the  expected  correlation  between 
energy  loss  and  the  variance  of  the  fragment  mass  and  charge  distributions  in  heavy  ion 
deep  inelastic  collisions.  But  there  are  other  features  of  the  heavy  ion  collisions  which 
are  also  governed  by  this  basic  mechanism.  The  transfer  of  the  orbital  angular 
momentum  to  the  fragment  spins,  and  the  misalignment  of  the  transferred  spins  with 
respect  to  the  reaction  plane  have  also  been  discussed  in  the  literature  (Randrup  1983) 
on  the  basis  of  the  nucleon  exchange  mechanism.  In  a  recent  study  (Ramamurthy  and 
Kapoor  1984)  aimed  at  understanding  the  observed  anomalous  fragment  angular 
distributions  in  heavy  ion  induced  fusion-fission,  where  the  relaxation  of  the  dinuclear 
complex  in  the  mass-asymmetric  degree  of  freedom  is  reached  during  fusion,  the 
analysis  of  the  data  brings  out  that  such  a  complex  relaxes  in  the  K-degree  of  freedom 
on  a  time  scale  of  several  times  1CT21  sec,  and  the  fused  composite  nuclei  with  fission 
barriers  comparable  to  temperature  can  fission  from  the  non-equilibrium  state  of 
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unrelaxed  K-distribution.  Presence  of  this  type  of  non-equilibrium  fission  is  shown  to 
be  responsible  for  the  apparently  anomalous  fragment  angular  distributions.  The 
importance  of  the  role  of  nucleon-exchange  mechanism  in  the  process  of  K- 
equilibration  has  been  recognized,  but  a  quantitative  study  is  needed  to  arrive  at  more 
definite  conclusions.  There  are  several  other  aspects  of  nucleon  exchange  mechanism  in 
fission  and  heavy-ion  reactions  such  as  the  effects  of  cluster  transfers  which  are  also 
currently  being  explored. 

In  conclusion,  the  nucleon  exchange  mechanism  operating  between  two  nuclei  in 
proximity  first  suggested  for  the  fission  process  has  now  been  extensively  studied 
through  the  deep  inelastic  heavy  ion  collisions.  The  experimentally  well-established 
correlation  between  the  dissipated  energy  and  the  variance  of  the  fragment  charge  or 
mass  distribution  in  the  deep  inelastic  heavy  ion  collisions,  and  the  transport  model 
description  taking  into  account  neutron-proton  correlations  in  the  exchange  process, 
bring  out  in  an  unambiguous  way  the  dominant  role  of  the  nucleon  exchange  process  in 
the  energy  loss  mechanism  in  heavy  ion  reactions.  It  is  also  recognized  that  several  other 
observed  features  in  heavy  ion  reactions  are  also  governed  by  this  underlying 
mechanism. 
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Angular  distribution  in  ternary  fission 

M  M  SHARMA  and  G  K  MEHTA 

Department  of  Physics,  Indian  Institute  of  Technology,  Kanpur  208016,  India 

Abstract.  The  angular  distribution  of  long-range  alpha  particles  emitted  in  keV-neutron 
induced  fission  of  235U  has  been  measured  using  a  technique  which  employs  only  a  particle 
telescope  to  derive  the  angular  information.  The  neutron  energy  region  investigated  is 
100  keV-1  MeV.  The  angular  distribution  of  lras  has  been  found  to  be  peaked  perpendicular 
to  the  neutron-direction  with  a  substantial  amount  of  anisotropy  near  200  keV. 

Keywords.  Fission;  LRA-fission;  angular  distribution. 

PACS  No.  25  85 


1.  Introduction 

Light-charged  particle  (lcp)  accompanied  fission  represents  an  interesting  phenom¬ 
enon  in  the  study  of  the  fission  process.  It  is  a  potential  source  of  information  on  the 
configuration  of  the  fissioning  nucleus  at  scission  and  the  subsequent  dynamics  of  the 
process  till  the  last  stage  of  rupture.  The  striking  feature  of  the  LCP-emission  is  that  their 
angular  distribution  is  strongly  peaked  perpendicular  to  the  fission-fragment  direction. 
This  has  led  to  the  belief  that  they  are  born  in  the  vicinity  of  the  scission  point  from  the 
neck  of  the  compound  nucleus.  Several  hypotheses  on  the  emission  mechanism  of  the 
lcp  s  viz  the  pre-scission  evaporation  model  (Ramanna  et  al  1963),  Carjan’s  model 
(1975)  of  pre-scission  emission,  the  post-scission  emission  model  by  Feather  (1969),  the 
sudden-snap  model  (Halpern  1965)  and  the  statistical  model  (Fong  1970)  are  in  vogue. 
However,  it  is  not  yet  established  whether  the  lcps  are  emitted  before  or  after  the 
scission.  The  experiments  on  the  angular  distribution  of  the  light-charged  particles 
(lcps)  about  the  fission  axis  have  yielded  some  information  on  the  configuration  of  the 
nucleus  at  the  scission  point  with  the  help  of  trajectory  calculations.  However,  there  is 
always  an  uncertainty  in  locating  the  scission  point  in  trajectory  calculations  which 
reflects  in  the  non-uniqueness  of  the  information  derived  about  the  dynamical  variables 
at  scission.  The  angular  distribution  of  the  lcps  about  the  space-fixed  axis  (incident 
neutron  direction)  could  provide  valuable  information  on  the  emission  mechanism  of 
these  lcps.  This  aspect  of  the  lcps  is  not  studied  well  and  the  existing  experiments  have 
yielded  conflicting  results.  The  present  paper  is  an  effort  to  understand  the  lcp- 
emission  mechanism  by  studying  the  anisotropy  of  lcps  about  the  incident  projectile 
direction. 

The  angular  distribution  of  the  binary  fission  fragments  in  neutron-induced  fission 
near  the  fission  threshold  exhibits  a  forward  peaking  about  the  incident  neutron 
direction.  On  the  other  hand,  the  trend  of  the  fragment  angular  distribution  in  ternary 
fission  (fission  accompanied  by  a  third  particle  (lcp))  is  unclear.  The  behaviour  of  the 
lcp  angular  distribution  should  depend  on  the  fact  whether  the  lcp  emission  takes  place 
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before  or  after  the  scission.  The  angular  distribution  of  the  fragments  and  the  lras 
(long-range  alpha  particles,  the  most  predominant  lcp)  in  14  MeV  neutron-induced 
fission  of  235U  was  measured  by  Ramanna  et  al  (1963).  They  observed  that  the  lra 
angular  distribution  was  peaked  fore  and  aft  along  the  beam  direction,  and  the 
associated  fragments  (referred  to  as  ternary  fragments)  were  peaked  at  90°  to  the  beam. 
This  behaviour  of  ternary  fragments  was  opposite  to  the  behaviour  of  fragments  in 
binary  fission.  This  result  implied  that  the  ternary  fission  process  is  distinctly  different 
from  the  binary  process  notwithstanding  that  all  the  other  characteristics  of  the  fission 

process,  such  as  kinetic  energy  distribution,  mass  distribution  etc  are  known  to  be  very 
similar  in  the  two  cases  of  the  binary  and  ternary  fission.  However,  in  another 
measurement,  1 7*5  MeV  proton-induced  fission  of  238U,  Atneosen  et  al  (1965) 
observed  that  the  ternary  fission  fragments  have  nearly  the  same  (0°-peaked)  angular 
distribution  as  the  binary  fission  fragments.  The  results  of  Ramanna  et  al  (1963)  and 
Atneosen  et  al  (1965)  are  contradictory  to  each  other.  In  both  these  experiments  there  is 
substantial  contribution  from  second  and  third  chance  fissions,  and  the  observed 
angular  distribution  arises  from  contributions  due  to  a  number  of  nuclides  fissioning  at 
different  excitation  energies.  Nadkarni  (1968)  measured  the  angular  distribution  of 
ternary  fission  fragments  and  lras  about  the  neutron-beam  direction  in  3  MeV 
neutron-induced  fission  of  235U.  The  ternary-fragment  anisotropy  was  found  to  be 
opposite  to  that  of  binary  fragments,  in  agreement  with  the  results  of  Ramanna  et  al 
(1963).  The  results  were  interpreted  by  assuming  that  the  lra  emission  takes  place  as  a 
result  of  evaporation  before  the  scission  point.  However,  it  was  not  understood  as  to 
why  in  this  particular  characteristic,  the  ternary  fragments  behave  in  a  completely 
different  way  than  the  binary  fragments  whereas  in  all  other  characteristics  they  are 
similar.  In  view  of  the  unclear  picture  of  ternary-fission  angular  distribution  as  stated 
above,  we  have  measured  the  angular  distribution  of  lras  in  the  neutron-induced 
fission  of  23 5U  for  several  energies  in  the  region  100  keV  to  1  MeV. 


2.  Experimental  technique 

The  low  probability  of  occurrence  of  the  ternary  fission,  particularly  in  fast-neutron 
induced  fission,  makes  it  difficult  to  investigate  the  angular  distribution  of  the  lra  s 

using  conventional  techniques.  We  have  developed  a  method  (Sharma  et  al  1984)  for 
determining  the  angular  distribution  using  a  AE  —  E  particle  telescope.  This  allows 

simultaneous  identification  of  the  particle  and  determination  of  its  angular  distri¬ 
bution.  The  energy  loss  in  the  A£-detector  is  proportional  to  thickness  Ax  of  the 
detector  for  the  normal  incidence.  However,  if  the  particle  is  incident  at  an  angle  6  with 
the  telexcope  axis,  it  suffers  an  energy  loss  proportional  to  the  slant  thickness  traversed 
in  the  A£-detector  and  therefore,  A£-signal  carries  an  information  on  the  angle  0. 
Using  the  range-energy  tables,  the  angular  distribution  is  extracted,  from  the  relation 

cos  0  =  Ax/Ax',  (1) 

where  Ax  and  Ax'  refer  to  the  normal  thickness  and  the  slant  distance  traversed  by  the 
particle  in  A£-detector  respectively. 
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3.  Experimental  details 

• 

The  experiment  used  a  97  %  enriched  23  5  U  source  of  thickness  5  mg/cm2  with  an  active 
area  4  cm2  and  a  semi-conductor  detector  particle  telescope  consisting  of  50  thick 
A£-detector  and  a  500  gm  thick  £-detector.  An  aluminium  foil  was  placed  between  the 
23 5 U  source  and  the  particle  telescope  to  stop  the  natural  alpha  particles  and  the  fission 
fragments  from  reaching  the  detectors.  The  space-fixed  axis  (neutron-beam  direction) 
coincided  with  the  telescope  axis.  The  distance  of  the  235U  source  from  the  neutron 
producing  target  was  such  that  the  neutron  divergence  was  %  10°.  The  neutrons  were 
produced  using  7Li(p,  n)  and  3H (p,  n)  reactions  with  2  MV  Van  de  Graaff  accelerator  at 
Indian  Institute  of  Technology,  Kanpur.  The  data  were  recorded  at  the  following 
neutron  energies:  thermal,  140  ±  30  keV,  1 70  ±  25  keV,  200  ±  25  keV,  400  +  200  keV, 
600+  180  keV  and  1000+  170  keV.  The  large  spread  at  higher  energies  was  owing  to 
the  use  of  6  Ci  tritium  target.  The  data  were  recorded  event-by-event  on  a  magnetic  tape 
using  a  3-parameter  data-acquisition  system  and  the  analysis  was  done  off-line  on 
DEC- 1090  computer. 

4.  Data  analysis  and  results 

The  added  advantage  of  our  technique  is  that  it  allows  the  use  of  broad  area  source 
which  helps  in  overcoming  the  count-rate  problem.  However,  with  broad  source  the 
contribution  to  the  angular  distribution  arises  from  every  point  of  the  source,  thereby 
causing  a  smearing  in  the  measured  angular  distribution.  The  actual  angular 
distribution  is  disentangled  from  the  experimental  one  by  Monte-Carlo  simulation 
with  the  given  geometrical  constraints.  Figure  1  shows  the  calculated  (Monte-Carlo) 
and  experimental  angular  distributions  for  thermal-neutron  induced  fission.  The 
calculated  distribution  has  been  obtained  for  isotropic  emission  in  thermal  neutron- 
induced  fission.  The  calculated  and  experimental  distributions  show  excellent  agree¬ 
ment.  The  curves  can  be  fitted  to  straight  lines  for  angles  beyond  10°.  The  points  upto 
10°  have  been  excluded  because  of  large  errors  due  to  (sin  0)~ 1  factor  near  0  =  0°  while 
calculating  drr/dQ.  The  least-square  fitted  straight  lines  to  the  calculated  and 
experimental  curves  give  identical  values  of  slopes.  The  slopes  of  the  straight  lines 
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Figure  1 .  Calculated  (Monte-Carlo)  and  experimental  angular  distributions  of  lra  s  emitted 
in  thermal-neutron  induced  fission. 
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depend  upon  the  anisotropy  of  the  angular  distribution.  Monte-Carlo  calculations  for 
various  anisotropic  distributions  were  carried  out.  The  distributions  were  assumed  to 
go  as 


W(6)  =  1  +  a  cos2  0,  (2) 

where  a  represents  the  anisotropy,  the  positive  value  indicating  a  fore  and  aft  peaking 
and  a  negative  one  showing  perpendicular  peaking.  Figure  2  shows  the  variation  of 
slopes  with  negative  values  of  a,  as  all  the  experimental  curves  in  the  fast-neutron  fission 
showed  a  decrease  in  the  slope  compared  to  thermal-neutron  fission.  This  corresponds 
to  a  negative  value  of  a  and  the  perpendicular  peaking.  For  quantitative  determination 
of  anisotropy,  figure  2  served  as  a  calibration  curve.  The  anisotropies  thus  determined 
are  shown  in  table  1.  Our  anisotropy  results  lead  to  the  conclusion  that  the  lra  angular 
distribution  is  peaked  perpendicular  to  the  neutron-beam  direction  at  all  neutron 
energies  from  140  keV  to  1  MeV  (figure  3).  Assuming  that  the  alpha-fragment  angular 


Figure  2.  Slope  of  the  least-squares  fitted  straight  lines  as  a  function  of  anisotropy  (a). 


Table  1.  Anisotropy  for  several  incident 
neutron  energies. 


Energy  of  incident  neutron 
(keV) 

Anisotropy  (a) 

(%) 

140  ±30 

(  —  85  ±28) 

170  +  25 

(  —  87  ±32) 

200  +  25 

( —  94  ±31) 

400  ±200 

( — 10  ±  28) 

600  ± 1 80 

( —  25  ±19) 

1000±  170 

(  —  50  ±27) 
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Figure  3.  Anisotropy  of  the  lras  at  different  neutron  energies. 


correlation  does  not  undergo  any  change  from  thermal  to  fast  fission,  the  ternary- 
fragment  angular  distribution  in  our  experiment  will  be  peaked  forward  and  backward 
to  neutron  direction  as  does  the  binary-fragment  angular  distribution.  We  find  that  the 
lra  anisotropy  is  very  high  near  200  keV  neutron  energy.  The  error  bars  indicated  in 
anisotropy  in  figure  3  are  due  to  statistical  origin  only  as  the  angular  resolution  due  to 
our  technique  is  only  ~  ±  3°  and  better  above  angle  1 5°. 


5.  Discussion 

The  angular  distribution  of  the  long-range  alpha  particles  (lra  s)  emitted  in  ternary 
fission  is  observed  to  be  peaked  perpendicular  to  the  incident  neutron  direction.  This 
implies  that  the  ternary  fission  fragments  are  forward-peaked  which  is  similar  to  the 
case  in  the  binary  fission.  Thus  it  is  concluded  that  the  characteristics  of  the  fission 
fragments  are  similar  in  both  binary  and  ternary  fission  in  all  aspects.  The  similarity  in 
angular  distribution  suggests  that  the  K-distributions  at  the  saddle  point  are  the  same 
whether  a  third  particle  accompanied  the  fission  or  not.  Earlier  measurements  of 
Ramanna  et  al  (1963)  and  Nadkarni  (1968)  do  not  agree  with  our  results.  However, 
these  angular-distribution  measurements  were  for  higher  energy  neutron-induced 
fission.  They  had  proposed  that  the  lra  emission  takes  place  as  evaporation  before  the 
scission  point.  Our  results  on  ternary  fission  in  the  neutron  energy  range  of  100  keV  to 
1  MeV  lend  credence  to  the  assertion  that  the  lra  emission  takes  place  near  the  scission 
point.  The  models  which  can  predict  the  angular  distributions  in  conformity  with  our 
results  are  the  sudden-snap  model  (Halpern  1965)  and  the  statistical  model  (Fong 
1970). 

We  find  that  the  anisotropy  is  very  high  near  200  keV  neutron  energy  and  then 
reduces  at  higher  energies.  In  the  region  around  200  keV,  the  interaction  of  p-wave 
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neutrons  is  predominant  and  the  states  2  + ,  3  + ,  4  +  and  5 4  are  accessible  to  the  fissioning 
nucleus  at  the  saddle  point.  The  increased  anisotropy  seems  to  be  due  to  these  positive 
parity  states  at  the  transition  point.  At  higher  neutron  energy,  other  partial  waves  also 
contribute  significantly  and  thus  one  observes  the  angular  distribution  averaged  over 
states  of  both  parities.  This  may  be  the  reason  for  decrease  in  anisotropy  at  higher 
energies. 
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Radiochemical  studies  on  fission  of  actinides 

M  V  RAMANIAH 

Bhabha  Atomic  Research  Centre,  Trombay,  Bombay  400085,  India 

Abstract.  Since  its  discovery  in  1939,  nuclear  fission  has  been  extensively  studied  by  various 
experimental  as  well  as  theoretical  groups  in  several  countries  leading  to  an  understanding  of 
major  aspects  of  this  important  and  complex  nuclear  reaction.  In  Trombay,  studies  have  been 
carried  out  in  the  last  25  years  using  both  physical  and  radiochemical  methods  and  significant 
contributions  have  been  made  towards  a  better  understanding  of  this  reaction.  This  paper 
presents  highlights  of  radiochemical  studies  on  fission  of  actinides,  particularly  mass,  kinetic 
energy  and  charge  distribution  and  fragment  angular  momentum.  Results  of  these  studies 
brought  out  the  important  role  played  by  deformation  energy  surface,  spherical  and  deformed 
nuclear  shells  and  nucleon  pairing. 

Keywords.  Nuclear  fission;  radiochemical  methods;  mass;  charge;  kinetic  energy  distri¬ 
butions;  fragment  angular  momenta;  nuclear  shells  and  nucleon  pairing. 

PACS  No.  25-85 


1.  Introduction 

The  high  sensitivity  and  specificity  of  radiochemical  methods  led  to  the  discovery  of 
nuclear  fission  by  Hahn  and  Strassmann  in  1939.  Since  then,  all  aspects  of  this  complex 
nuclear  reaction  have  been  studied  extensively  by  radiochemical  as  well  as  physical 
methods.  Soon  after  the  discovery,  Bohr  and  Wheeler  (1939)  put  forward  a  theory  of 
nuclear  fission  based  on  the  liquid  drop  model  (ldm),  which  was  able  to  explain  some 
broad  features  of  the  reaction  such  as  energetics  of  fission  process,  fission  probability 
and  fissionability.  The  most  striking  feature  of  fission,  namely,  asymmetric  mass 
distribution  could  not  be  explained  by  the  ldm,  according  to  which  the  mass 
distribution  in  low  energy  fission  should  be  symmetric.  This  problem  evoked  great 
interest  in  several  experimental  and  theoretical  groups  in  the  world.  Within  about  a 
decade  since  the  discovery  of  fission,  a  massive  amount  of  experimental  data  was 
collected  on  mass,  charge  and  kinetic  energy  distribution  as  well  as  other  aspects. 
Considerable  effort  was  directed  towards  understanding  the  variation  of  potential 
energy  surface  as  a  function  of  symmetric  and  asymmetric  deformation  for  a  wide  range 
of  fissionability  parameters  with  the  aim  of  providing  an  explanation  for  the  observed 
mass  asymmetry  (Wilets  1964).  Results  of  extensive  experimental  work  covering  mass, 
charge  and  kinetic  energy  distribution  as  well  as  prompt  neutron  and  gamma  emission 
brought  out  the  importance  of  nuclear  shells. 

Several  theoretical  approaches  were  made  incorporating  shell  effects,  prominent 
among  them  being  the  statistical  model  (Fong  1956),  and  the  nucleon  exchange  model 
(Ramanna  1964).  As  discussed  by  Kapoor  and  Ramamurthy  in  a  paper  to  this  volume, 
nucleon  exchange  model  can  explain  one-,  two-,  and  three-peaked  mass  distributions  in 
terms  of  exchange  of  nucleons  taking  into  account  the  influence  of  fragment  nuclear 
shells.  An  important  breakthrough  came  from  the  work  of  Myers  and  Swiatecki  (1967) 
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and  Strutinsky  (1967).  Incorporation  of  single  particle  effects  over  and  above  that  of 
liquid  drop  potential  by  these  groups  resulted  in  a  double-humped  fission  barrier  in 
contrast  to  a  single-humped  barrier  resulting  from  ldm  consideration.  Moller  and 
Nilsson  (1970)  showed  that  in  the  vicinity  of  the  second  saddle  point,  asymmetric 
deformation  of  the  fissioning  nucleus  results  in  the  lowering  of  the  fission  barrier,  which 
provided  a  qualitative  explanation  for  the  asymmetric  mass  distribution  in  terms  of  the 
double-humped  fission  barrier.  The  effect  of  washing  out  of  nuclear  shell  effects  with 
excitation  energy  on  the  observable  quantities  like  the  fragment  angular  distribution 
and  fission  probabilities  in  the  case  of  nuclei  with  double-humped  barrier  was  later 
worked  out  by  Ramamurthy  et  al  (1970).  Mustafa  et  al  (1973),  based  on  the  two-centre 
shell  model,  showed  that  the  shell  structure  in  the  fragments  influences  the  potential 
energy  surface,  explaining  qualitatively  the  mass  distribution  of  a  wide  range  of 
fissioning  nuclei.  More  recently,  Wilkins  et  al  (1976)  and  Prakash  et  al  (1979)  had 
attempted  to  explain  various  features  of  low  energy  fission  as  a  result  of  recognising  the 
importance  of  deformed  nuclear  shells  in  addition  to  spherical  shells  at  scission  in  their 
model  based  on  the  quasi-statistical  equilibrium  near  scission.  A  large  volume  of  results 
of  the  theoretical  and  experimental  studies  of  low-energy  fission  have  been  published  in 
a  number  of  excellent  reviews  and  proceedings  of  International  Conferences  (Hyde 
1966;  Vandenbosch  and  Huizenga  1974;  Proc.  of  the  IAEA  Symposia  on  Physics  and 
Chemistry  of  Fission;  1965,  1969,  1973  and  1979). 

At  Trombay,  work  on  nuclear  fission  was  started  by  Dr  Ramanna  and  it  is  because  of 
his  continued  interest  and  strong  support  that  basic  research  in  this  important  field 
could  be  pursued  and  significant  contributions  made.  Both  physical  and  radiochemical 
methods  have  been  employed  in  these  investigations. 

The  areas  of  fission  research  to  which  a  substantial  contribution  has  been  made  by 
the  Trombay  group  using  physical  techniques  such  as  ionization  chambers,  scintillation 
detectors,  semiconductor  detectors  and  time  of  flight  are  the  studies  of  fission  fragment 
characteristics  and  secondary  radiations  emitted  in  fission.  The  early  investigations 
concerned  the  mass,  kinetic  energy  and  angular  distributions  of  fission  fragments  and 
the  correlations  among  them  (Kapoor  et  al  1965;  Rekha  Govil  et  al  1983),  and  the 
emission  of  neutrons  (Ramanna  et  al  1961;  Kapoor  et  al  1963),  gamma  rays  (Kapoor 
and  Ramanna  1964)  and  K-x-rays  (Kapoor  et  al  1968, 1969, 1971;  Kataria  et  al  1970)  in 
coincidence  with  the  fission  fragments.  In  the  last  few  years,  a  comprehensive 
programme  to  study  the  light-charged  particles  in  fission  has  also  been  carried  out 
(Ajitanand  et  al  1975;  Choudhury  et  al  1976,  1980),  the  results  of  which  have  been 
reviewed  elsewhere  (Nadkarni  1982). 

Radiochemical  studies  on  the  fission  of  actinides  were  initiated  with  the  aim  of 
understanding  the  systematics  of  low  energy  fission,  particularly  distribution  of  mass, 
charge,  kinetic  energy  and  angular  momentum  covering  a  wide  range  of  elements.  The 
results  contributed  to  a  better  understanding  of  this  interesting  nuclear  reaction, 
particularly  the  role  played  by  both  the  spherical  and  deformed  nuclear  shells  and  the 
effect  of  nucleon  pairing.  In  this  article,  we  present  a  summary  of  the  Trombay 
radiochemical  work  highlighting  the  important  findings. 


2.  Mass  distribution 

At  Trombay,  mass  distribution  studies  were  carried  out  in  the  neutron-induced  fission 
of  a  number  of  actinides  ranging  from  221  Ac  to  245Cm.  These  studies  involved 
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radiochemical  separations  of  fission  products  followed  by  their  estimation  using  beta 
or  gamma  counting  or  direct  counting  of  catcher  foils  on  a  high  resolution  germanium 
spectrometer.  Several  methods  such  as  absolute  method,  relative  method  and 
comparison  method  (Jain  and  Ramaniah  1973)  were  employed  in  these  studies.  Some 
relevant  results  are  presented  in  table  1.  One  interesting  feature  which  emerged  from 
the  extensive  data  obtained  at  Trombay  and  from  the  literature  data  covering  a  wide 
range  of  fissioning  nuclei  from  2 27 Ac  to  259Md  is  the  transition  from  three-peaked 
distribution  in  the  Ac-Th-U  region  to  two-peaked  distribution  in  heavier  actinides  and 
a  single-peaked  mass  distribution  in  the  Fm-Md  region.  The  transition  from  three- 
peaked  to  two-peaked  distribution  in  Ac-Th  region  (Jensen  and  Fairhall  1958)  was 
explained  on  the  basis  of  two-mode  fission  hypothesis  put  forward  by  Turkevich  and 
Niday  (1951)  according  to  which  the  three-peaked  mass  distribution  is  a  result  of 
superposition  of  asymmetric  and  symmetric  mass  distributions  and  transition  to  an 
asymmetric  mass  distribution  occurs  rather  sharply  in  going  from  actinium  to  thorium. 

Neutron-induced  fission  of  232Th  (Iyer  et  al  1963)  was  studied  at  Trombay  and  a 
small  symmetric  peak  was  also  observed  besides  the  two  large  asymmetric  peaks 
indicating  that  the  extent  of  symmetric  contribution  may  not  strongly  depend  on  the 
proton  number  of  the  fissioning  nucleus.  This  was  further  confirmed  by  the  mass 
distribution  in  the  neutron  fission  of  232 U  (Manohar  et  al  1979)  which  showed  a 
symmetric  peak  similar  to  that  in  the  neutron  fission  of  232Th.  The  dependence  of 
symmetric  contribution  on  proton  and  neutron  number  of  the  fissioning  nucleus  was 
explained  on  the  basis  of  the  influence  of  variation  of  the  difference  between  barrier 
heights  for  symmetric  and  asymmetric  modes  (Es-Ea)  obtained  from  the  work  of 
Moller  and  Nix  (1974)  as  a  function  of  proton  and  neutron  number  as  shown  in  figures 
la  and  lb.  It  is  seen  that  (Es-Ea)  varies  more  sharply  with  the  neutron  number  than  the 
proton  number  of  the  fissioning  nucleus  and  the  contribution  of  the  symmetric 
component  increases  as  (Es-Ea)  decreases.  The  observability  of  the  symmetric  peak  in 
232U  could  be  understood  as  due  to  a  decrease  in  (Es-Ea)  with  decrease  in  the  neutron 
number  from  143  to  141,  which  more  than  compensates  for  the  increase  in  (Es-Ea)  with 
increase  in  the  proton  number  from  90  to  92. 


Table  1.  The  mass  distribution  characteristics  for  different  fissioning  nuclei  studied  at 
Trombay 


Fissioning 

nucleus 

Most  probable 

Light 

mass 

Heavy  ] 

FWTM 

P/V 

Remarks 

228Ac 

88-0 

138-0 

18-0 

42 

Three-peaked  *fl) 

23°Th 

88-5 

139-5 

15-8 

261 

Three-peaked <b) 

233Tb 

930 

137-5 

21-0 

111 

Three-peaked  *c) 

232pa 

94-2 

139-8 

20-5 

97 

Two- peaked*40 

233U 

94  1 

138-9 

210 

480 

Three-peaked*'0 

234u 

950 

139-0 

21-9 

330 

Three-peaked  *a) 

236U 

95-5 

139-5 

21-2 

620 

Two-peaked  *e) 

238  Np 

980 

140-0 

23-8 

140 

Two-peaked(a) 

24°pu 

100-4 

139-6 

24-0 

200 

Two-peaked* 

242pu 

102-6 

139-4 

24-0 

600 

Two-peaked*4’1 

246Cm 

105-3 

140-7 

25-0 

155 

Two-peaked  *tf) 

0  Iyer  et 

al  (1965;  b  Singh  et  al  (1982);  c 

Iyer  et  al  (1963);  d 

Manohar 

et  al  (1979);  e  Chitambar 

(1984);  f 

Jain  et  al  (to  be  published  in  Radiochimica  Acta)\ 

9  Ramaswami  et  al  (1979). 
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Figure  2.  Variation  of  peak  to  valley  ratio  as  a  function  of  mass  of  fissioning  nucleus  in  low 
energy  fission  (Manohar  et  al  1979). 


Figure  3.  Mass  yield  curve  in  reactor  neutron  induced  fission  of  238U  extrapolated  beyond 
A  _  70  on  lighter  wing  side  and  A  =  161  on  heavier  wing  side.  Experimental  points  show  the 
presence  of  new  “shoulders”  or  “humps”  in  highly  asymmetric  region  due  to  28  proton  shell 
effect  (Rao  et  al  1979). 
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This  dependence  of  (Es-Ea)  on  proton  and  neutron  number  of  the  fissioning  nucleus 
was  also  able  to  explain  the  variation  of  the  peak-to-valley  ratio,  P/V  (the  ratio  of  the 
highest  yield  to  the  yield  corresponding  to  the  symmetric  mass  split),  with  mass  number 
of  the  fissioning  nucleus  as  shown  in  figure  2.  It  can  be  seen  that  236U  has  the  highest 
P/V  ratio,  corresponding  to  the  maxima  in  (Es-Ea)  around  Z  —  92  and  N  =  146.  For 
fissioning  nuclei  heavier  than  236U,  (Es-Ea)  decreases  again,  but  the  symmetric  peak  is 
not  observed  any  more  as  it  is  obscured  by  the  broadening  of  the  mass  distribution  and 
the  shift  in  the  position  of  light  wing  towards  heavier  mass. 

Another  interesting  finding  at  Trombay  is  the  influence  of  28  proton  shell  in 
enhancing  the  yields  of  fission  products  at  highly  asymmetric  mass  region  66-67  and 
the  complementary  masses  as  shown  in  figure  3  (Rao  et  al  1979).  The  measured  yields  in 
the  range  10~3-10~6  %are  about  1-3  orders  higher  than  those  expected  from  a  smooth 
curve. 


3.  Kinetic  energy  distribution 

A  major  part  of  energy  released  in  low  energy  fission  appears  in  the  form  of  fragment 
kinetic  energy.  Studies  on  kinetic  energy  distribution  provide  information  on  the 
influence  of  fragment  nuclear  shells,  fragment  deformation  and  scission  point 
configuration  of  the  fissioning  nucleus.  Extensive  studies  have  been  carried  out  in  this 
area  using  physical  methods  and  time-of-flight  measurements.  The  physical  methods 
have  limited  sensitivity  in  the  low  fission  yield  region  and  in  the  case  of  target  nuclei 
with  high  alpha  specific  activity.  Due  to  their  high  sensitivity,  radiochemical  methods 
have  been  widely  used  particularly  where  physical  methods  have  limitations.  In  the 
radiochemical  methods,  kinetic  energies  of  fission  products  are  calculated  from  the 
measured  recoil  ranges  in  a  suitable  stopping  medium  such  as  Al,  air,  etc  using  empirical 
range-energy  equations.  The  kinetic  energies  of  fission  products  are  converted  into 
those  of  fission  fragments  by  appropriate  corrections  for  neutron  evaporation.  This 
procedure  gives  accurate  kinetic  energy  values  when  range-energy  equations  are 
calibrated  using  kinetic  energy  data  obtained  by  physical  methods  on  high  yield  fission 
products. 

Kinetic  energy  distribution  in  low  energy  fission  shows  a  maximum  around  mass  1 32, 
but  not  at  the  symmetric  mass  split  as  expected  from  the  ldm.  The  difference  between 
maximum  kinetic  energy  and  the  kinetic  energy  corresponding  to  symmetric  split 
known  as  kinetic  energy  deficit  (ked),  is  explained  on  the  basis  of  compact  scission 
configuration  of  the  fissioning  nucleus  for  mass  split  corresponding  to  132  (due  to  the 
presence  of  the  doubly  magic  spherical  shell  with  Z  =  50  and  N  =  82)  resulting  in  a 
shorter  distance  ( D )  between  charge  centres  of  the  complementary  fragments  compared 
to  that  for  a  symmetric  mass  split. 

Kinetic  energy  distribution  in  the  neutron  fission  of  232Th,  232U,  233U,  237Np, 
239Pu,  241  Pu,  241  Am  and  245Cm  was  studied  leading  to  estimates  of  ked  and  the 
average  total  kinetic  energy  (tke)  (Satya  Prakash  et  al  1969, 1972a,  b;  Dange  et  al  1975; 
Ramaswami  et  al  1977).  A  linear  correlation  was  found  to  exist  between  ked  and  the 
difference  of  fragment  mass  for  symmetric  split  from  the  doubly  magic  number  1 32  as 
shown  in  figure  4.  An  extrapolation  of  this  correlation  indicates  that  ked  for  the  fission 
of  264Fm  tends  towards  zero  and  this  trend  was  confirmed  experimentally  in  the  fission 
of  258Fm  and  259Fm  (Flynn  et  al  1978;  Ragaini  et  al  1974). 
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Figure  4.  Correlation  of  kinetic  energy  deficit  (iced)  with  shell  structure  (Satya  Prakash  et  al 
1969). 
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Figure  5.  Variation  of  relative  deformation  parameter  with  mass  split  for  fission  of  237  Np 
(Dange  et  al  1975). 


The  kinetic  energy  data  from  the  large  number  of  fissioning  nuclei  mentioned  above 
was  used  to  calculate  the  relative  deformation  and  deformation  parameters  of  various 
fission  products  to  examine  the  dependence  of  deformation  parameter  on  the 
fissionability  parameter  X.  For  this  purpose  the  D  values  for  various  mass  splits  were 
first  calculated  from  the  experimental  values  of  kinetic  energies.  The  D  values  for  a 
particular  mass  split  was  then  apportioned  between  the  complementary  fragments  on 
the  basis  of  the  proximity  of  their  neutron  and  proton  number  to  the  nearest  closed 
shell.  Figure  5  shows  a  typical  plot  of  deformations  of  fragments  as  a  function  of  their 
masses.  It  was  found  that  fragment  deformation  increases  with  increase  in  X  for 
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MASS  NUMBER 

Figure  6.  Variation  of  prompt  neutron  multiplicity  (v)  with  mass  number;  curve  • . • 

is  experimental  curve  (Apalin  et  al  1 964);  A - A  is  calculated  curve  obtained  using  mass 

formula  of  Myers  and  Swiatecki  1965  and  calculated  • - •  curve  using  mass  formula  of 

Wing  and  Fong  1964. 


symmetric  mass  division,  but  does  not  change  appreciably  for  asymmetric  mass 
division.  This  difference  could  be  explained  on  the  basis  of  cylindrical  shape  of  saddle 
point  for  nuclei  with  X  ^  07  which  takes  longer  time  to  arrive  at  scission  point  with 
considerable  stretching  of  the  neck  with  increasing  X.  Events  leading  to  symmetric 
splits  seem  to  be  mainly  governed  by  liquid  drop  behaviour  of  the  fissioning  nucleus 
while  asymmetric  splits  are  influenced  by  fragment  shells  and  hence  remain  insensitive 
to  the  variation  of  X.  From  the  calculated  fragment  deformation,  the  total  experimental 
kinetic  energy  and  the  calculated  Q  values,  the  deformation  energies  of  the  fragments 
were  calculated  leading  to  estimates  of  the  prompt  neutron  multiplicities  as  a  function 
of  fragment  mass.  Typical  results  for  thermal  neutron  fission  of  235U  are  shown  in 
figure  6. 


4.  Charge  distribution 

Studies  on  charge  distribution  provide  considerable  insight  into  the  influence  of 
nuclear  shells  and  nucleon  pairing  and  dynamical  aspects  of  descent  of  the  fissioning 
nucleus.  The  isobaric  and  isotopic  yield  distributions  are  of  Gaussian  nature 
characterised  by  the  most  probable  charge,  Zp  or  mass  Ap  and  the  corresponding  width 
parameters,  oz  or  o A. 

Experimental  investigations  on  charge  distribution  involve  determination  of  the 
fractional  cumulative  yields  (fcy)  or  fractional  independent  yields  (fiy)  of  isobaric 
fission  products  or  conversion  of  the  independent  yields  of  fission  products  to 
fractional  isotopic  yields  making  use  of  elemental  yields.  The  physical  methods  involve 
on-line  measurements  of  x-ray  or  y-ray  intensities  in  coincidence  with  kinetic  energies 
of  pairs  of  complementary  fission  products  or  use  of  mass  separators,  while 
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radiochemical  methods  are  off-line  and  involve  separation  of  fission  products  of 
interest  and  their  assay  by  fi/y  counting  or  direct  gamma  ray  spectrometry.  The 
necessity  of  the  determination  of  the  independent  yields  of  fission  products  of  short 
half-lives  was  a  limitation  of  radiochemical  methods  in  the  past.  However,  the  advent  of 
fast  and  automated  transport  and  radiochemical  separation  techniques  during  the  last 
decade  has  attenuated  this  problem. 

In  our  laboratories,  isobaric  charge  distribution  studies  have  been  carried  out  in  the 
mass  region  100-105  and  130-140  in  the  neutron  fission  of  229Th,  239Pu,  241  Pu  and 
245 Cm  (Rattan  et  al  1983;  Datta  et  al  1980;  Ramaswami  et  al  1982)  and  in  the 
spontaneous  fission  of  252Cf  (Manohar  et  al  1978;  Srivastava  et  al  1984).  In  addition, 
investigations  have  also  been  carried  out  on  the  isotopic  yields  distribution  of 
technetium  in  the  neutron  fission  of  239Pu  and  spontaneous  fission  of  252Cf  and  of 
iodine  isotopes  in  252Cf(sF)  and  in  30  and  40  MeV  alpha-induced  fission  of  232Th 
(Reddy  et  al  1984). 

Spontaneously  fissioning  nuclei  are  ideal  to  understand  the  role  of  potential  energy 
surface  of  fissioning  nucleus  and  fragment  nuclear  structure  on  charge  distribution. 
The  fcy  or  fiy  of  a  number  of  isobars  obtained  in  our  laboratory  along  with  the  data 
from  literature  (Wahl  1983)  were  analysed  to  obtain  the  values  for  Zp  and  oz  for  mass 
chains  105,  131,  139,  140  and  141.  It  was  found  that  the  Gaussian  distribution  with 
oz  —  0*60  +  0-06  represents  most  of  the  yields  in  252Cf  as  shown  in  figure  7.  It  was 


Z-Zp 

Figure  7.  A  plot  of  fractional  cumulative  yields  (fcy)  as  a  function  of  (Z-Zp)  in  the 
spontaneous  fission  of  252Cf  (Srivastava  et  al  1984;  Srivastava  1984). 
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further  observed  that  in  the  mass  region  1 30—1 35,  a z  value  is  on  the  lower  side  (i.e.  az 
—  0-50  to  055)  indicating  sharper  charge  distribution  in  this  mass  region,  apparently 
due  to  preferential  formation  of  82  shell  fragments.  An  estimate  of  proton  pairing  effect 
(±  12%)  was  obtained  based  on  our  data  on  elemental  yield  of  iodine  and  literature 
data  on  other  elements  in  the  spontaneous  fission  of  252Cf. 

From  the  data  presented  in  figure  8  for  252Cf,  it  is  seen  that  the  deviation  of  Zp  from 
ZUCD  (AZ)  increases  with  mass  asymmetry  as  expected  from  the  minimum  potential 
energy  hypothesis.  From  a  similar  analysis  of  variation  of  AZ  with  mass  asymmetry  in 
the  neutron  fission  of233U,  235U,  239Pu  and  249Cf  (Wahl  1983;  Mariolopoulos  1981),  it 
was  noted  that  the  rate  of  increase  of  AZ  with  mass  asymmetry  increases  with 
fissionability  parameter  as  shown  in  table  2.  With  increasing  fissionability,  the  saddle 
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Figure  8.  Variation  of  charge  polarization  with  fragment  mass  in  the  spontaneous  fission  of 
252Cf  (Srivastava  et  al  1984). 


Table  2.  Charge  polarization  as  a  function  of 
mass  asymmetry  (AZ/AM)  in  different  fissioning 
nuclei. 


Fissioning 

nucleus 

AZ/AM 

Fissionability 
parameter  X 

236u 

0-010  ±0-005 

0-772 

234u 

0-01 5  ±0-005 

0-774 

240pu 

0-01 5  ±0-005 

0-790 

250Cf 

0-025  ±0-002 

0-824 

252Cf 

0-036  ±  0-006 

0-821 

* 


X  = - 

50-88 


z2a 


1  -1-78261 


A  —  2Z 


where  A  and  Z  are  the  mass  and  charge  of  the 
fissioning  nucleus. 
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point  shapes  becomes  more  compact  (as  pointed  out  in  §  3)  and  the  nuclear  viscosity 
increases  making  the  time  of  descent  longer,  permitting  higher  charge  polarization  (A  Z) 
to  achieve  preferred  configurations. 

The  elemental  yields  for  technetium  (Z  =  43)  in  the  neutron  fission  of  239Pu  and 
spontaneous  fission  of  252Cf  were  deduced  from  the  isotopic  yield  distribution  data. 
F  rom  a  comparison  of  these  data  with  the  data  in  the  neutron  fission  of  235  U,  it  is  seen 
that  technetium  yield  increases  in  the  order  of  fissioning  nuclei  236U  <  240Pu  <  252Cf 
as  shown  in  table  3.  The  same  table  also  shows  that  while  the  Z  of  complementary 
fragment  recedes  from  magic  shell  closure  at  Z  =  50  from  236U  to  252Cf,  the  neutron 
number  of  complementary  fragment  approaches  spherical  neutron  shell  at  N  =  82  in 
the  fissioning  nuclei  240Pu  and  the  deformed  neutron  shell  at  N  =  88  in  252Cf.  It  is  also 
seen  that  the  neutron-to-proton  ratio  of  the  complementary  fragment  lies  close  to  that 
of  the  fissioning  nucleus  as  one  approaches  252Cf  indicating  that  shell  effect  is  most 
strongly  manifested  when  it  is  in  phase  with  liquid  drop  energetics. 

The  investigation  of  isotopic  yields  of  iodine  (Z  =  53)  in  30-40  MeV  a-induced 
fission  of  232Th  throws  some  light  on  the  extent  of  mass/charge  equilibration  in  fission. 
Table  4  shows  the  data  on  isotopic  yield  distribution  parameter,  Ap  and  o A  as  obtained 
in  232Th  (a3(>40Mev,/)and  the  same  based  on  literature  data  in  235U(«th  f).  With  increase 
in  excitation  energy  the  Ap  value  approaches  the  value  expected  from  unchanged  charge 
distribution  hypothesis.  The  increase  of  o A  with  excitation  energy  was  interpreted  to  be 
due  to  influence  of  temperature  on  mass/charge  equilibration,  after  taking  into  account 
multichance  fission,  indicating  that  with  increase  of  excitation  energy  the  time  of 
descent  is  shorter  permitting  less  time  for  acquiring  the  preferred  configuration  as  seen 
at  lower  excitation  energies. 


Table  3.  Most  probable  neutron  ( Np )  values  for  charge  splits  involving  technetium  and  its 
complementary  products  in  235U(nth,/),  239Pu(nth,/)  and  252Cf(SF). 


Fissioning 

nucleus 

Element 

percent 

Q* 

236u 

Technetium  (43) 
Indium  (49) 

10914±O14 
126-86  ±01 4 

66-14±0-14 
77-86  ±0-1 4 

0-092  ±0-024 

1-017 

24°pu 

Technetium  (43) 
Antimony  (51) 

108-35  ±0-45 
131-65  ±0-45 

65-35  ±0-45 
80-65  ±0-45 

8  00  ±1-32 

1-022 

252Cf 

Technetium  (43) 
Cesium  (55) 

109-10±0-50 
142-90  ±0-50 

66-10±0-50 
87-90  ±0-50 

16-90  ±2-09 

1-017 

*  Q  —  (Np/Z)fragment/(^/Z)fissioning  nucleus- 


Table  4.  Iodine  isotopic  yield  distribution  parameters. 


Fissioning  system 

Excitation 
energy  (MeV) 

Ap  value 

<7/4  expt. 

<7^  Calc. 

(in  236U*) 

235U  +  nth 

6-54 

135-9 

121  ±0*12 

1  30 

232Th  +  a30  Mey 

23-4 

135  09 

l-60±0-16 

1-51 

232Th  +  OC40  MeV 

33-2 

134-8 

2-37  ±0-24 

1-71 
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5.  Angular  momentum 

Measurement  of  angular  momentum  of  fission  fragments  provide  information  on 
scission  configuration  and  the  dynamics  of  motion  past  the  second  saddle  as  fragments 
acquire  their  angular  momenta  due  to  mutual  coulombic  torque  for  axially  asymmetric 
scission  configuration  and  thus  depend  on  fragment  deformation.  Among  the  various 
methods  for  the  estimation  of  fragment  angular  momenta,  physical  methods  have 
certain  limitations,  e.g .  measurements  of  prompt  gamma  angular  anisotropy  provide 
only  the  gross  estimate  of  average  angular  momentum  and  measurements  on  prompt 
gamma  multiplicity  and  energy  are  dependent  on  the  choice  of  various  parameters. 
Radiochemical  method  is  the  most  suitable  as  it  provides  fragment  angular  momenta  as 
a  function  of  their  mass  and  charge  and  has  much  less  number  of  adjustable  parameters. 
The  radiochemical  method  involves  determination  of  the  independent  yield  ratios  of 
the  fission  product  isomers  followed  by  statistical  model  based  analysis  (Vandenbosch 
and  Huizenga  1960)  to  correct  for  fragment  de-excitation  to  arrive  at  fragment  angular 
momenta. 

At  Trombay  angular  momenta  of  fragments  corresponding  to  the  fission  products 
isomeric  pairs  95Nb  and  1 32I  in  the  neutron  fission  of  23 3U, 1 31» 1 33Te  in  neutron  fission 
of  24 1  Pu  and  of  1 1 1  Pd  and  1 1 7Cd,  1 3 1 » 1 3 3Je  and  1 34I  in  spontaneous  fission  of  2  5 2Cf 
have  been  estimated  (Datta  et  al  1982;  Guin  et  al  1983;  Data  1984).  Studies  have  also 
been  carried  out  for  131Teand  133Tein  the  30  to  40  MeV  alpha  induced  fission  of232Th 
(Datta  et  al  1983). 

Results  of  investigation  in  the  neutron  fission  of  233U  (as  given  in  figure  9)  showed 


Figure  9.  Plot  of  fragment  angular  momentum  vs  fragment  atomic  number  in  233U(n  ^ 
(Datta  et  al  1982). 
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that  angular  momentum  of  fragments  around  doubly  magic  shell  region  132  (Z  =  50, 
N  =  82)  decreases  as  corresponding  atomic  number  deviates  from  50  proton  spherical 
shell  contrary  to  expectation.  This  apparent  anomaly  is  explained  on  the  basis  of  the 
presence  of  the  deformed  shell  (Z  =  38)  in  the  complementary  fragment  that  tends  to 
reduce  the  deformation,  thus  lowering  the  angular  momentum  of  the  corresponding 
heavy  fragment  as  atomic  number  of  the  latter  approaches  Z  =  54  from  Z  =  50. 

In  the  spontaneous  fission  of  252Cf,  it  was  observed  that  there  exists  an  inverse 
correlation  between  fragment  angular  momentum  and  elemental  yield  with  odd  Z 
fragments  having  high  angular  momentum  and  lower  yield  compared  to  neighbouring 
even  Z  elements  as  shown  in  figure  10.  This  implied  that  for  fragments  with  higher 
angular  momentum,  larger  amount  of  energy  remains  tied  up  as  rotational  energy 
reducing  the  intrinsic  excitation  energy  and  in  turn  their  yield  according  to  the 
statistical  model.  Further,  for  odd  Z  elements  high  angular  momenta  indicates  higher 
scission  point  deformation  compared  to  their  even  Z  neighbours  in  the  magic  shell 
region  arising  due  to  polarization  of  even-even  core  of  protons  by  the  odd-proton  since 
in  the  spherical  shell  region  of  N  =  82  nuclear  deformation  is  more  strongly  affected  by 
protons  than  neutrons  as  has  also  been  concluded  from  recent  theoretical  calculations 
(Madsen  and  Brown  1984). 

The  observations  on  fragment  angular  momenta  in  the  252Cf(sF)  show  that  the 
fragment  angular  momenta  are  correlated  to  the  scission  point  deformation  dependent, 
in  turn,  upon  fragment  nuclear  structure.  As  a  consequence  one  would  expect  a 
correlation  between  fragment  angular  momentum  and  prompt  neutron  number. 
However,  lack  of  any  such  correlation  as  observed  has  been  explained  to  be  due  to  the 


Figure  1 0.  Correlation  of  elemental  yield  and  angular  momentum  in  2  5  2Cf  (Datta  et  al  to  be 
published). 


150 


M  V  Ramaniah 


fact  that  the  prompt  neutron  number  depends  on  total  excitation  energy  of  fragment 
after  shape  relaxation. 

Fragment  deformation  at  scission  has  been  deduced  based  on  simple  theoretical 
calculation  in  terms  of  bending  mode  oscillation  of  the  fissioning  nucleus  and  statistical 
model  (figure  11).  The  Bohr-Mottelson  deformation  parameters  (/?)  for  various 
fragments  in  the  spontaneous  fission  of  252Cf  could  be  obtained.  The  values  for  ^Pd 
(N  =  66)  and  ^JXe  (N  —  88)  are  063  and  072  respectively  which  are  theoretically 


PRE-SCISSION  CONFIGURATION  Ineck  radius  c) 


\ 


ONSET  OF  BENDING  OSCILLATION 


NON-LINEAR  SCISSION  CONFIGURATION 
(  SENDING  AMPLITUDE  r,  .  r2  1 


Figure  11.  Schematic  of  origin  of  fragment  angular  momentum  due  to  precision  bending 
mode  oscillations. 


Table  5.  Comparison  of  fragment  angular  momentum  in  235U  at  various  excitation 
energies. 


Fissioning 

system 

Excitation 
energy  ( E *) 
(MeV) 

Initial  angular 
momentum 
<l2>1/2fi 

Fragment  angular 
momentum  (Bft) 

131Te  133Te 

Calculated 
B h  value 
for  133Te 

235U(nth,/) 

6-54 

3-4 

6- 0  ±1-5° 

7- 5  ±0-7b 

5- 9±l-5a 

6- 5±0-4b 

— 

232Th  (a3o  m  eV>/) 

23-36 

10-2 

5-7  ±0-6 

6-9  ±04 

7-03 

232Th (<*40  MeV>/) 

33-20 

144 

5-8  ±0-8 

7-9  ±0-8 

7-81 

a  Sarantites  et  al  1965. 
b  Imanishi  et  al  1976. 
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predicted  deformation  parameters  for  occurrence  of  deformed  shells  at  these  neutron 
numbers,  in  low  energy  fission.  Investigations  on  30  to  40  MeV  alpha  induced  fission  of 
232Th  indicated  that  the  angular  momenta  of  fragments  in  the  magic  shell  region 
remains  unchanged  with  excitation  energy  as  shown  in  table  5.  These  observations  have 
been  interpreted  in  terms  of  interplay  between  tendency  of  increase  in  angular 
momenta  at  higher  excitation  energy  as  against  stiffness  of  compound  nucleus  at  higher 
angular  momentum  towards  bending  oscillation  and  shorter  time  of  descent.  In  order 
to  analyse  the  influence  of  the  fissioning  nucleus  excitation  energy  and  angular 
momentum,  calculations  were  carried  out  for  evaluation  of  fragment  angular  momenta. 
This  calculation  was  based  on  retainment  of  angular  velocity  of  Fermi  gas  type 
fissioning  nucleus  at  saddle  point  and  population  of  higher  rotational  states  at  higher 
excitation  energy  taking  into  account  the  effect  of  multichance  fission.  The  agreements 
between  calculated  and  observed  angular  momenta  indicates  that  at  higher  energy 
fragment  angular  momentum  is  primarily  dependent  on  the  properties  of  fissioning 
nucleus  at  the  saddle  point. 


6.  Conclusion 

Radiochemical  studies  of  fission  carried  out  at  Trombay  during  the  last  25  years  have 
contributed  to  a  better  understanding  of  this  highly  interesting  and  complex  nuclear 
reaction.  Extensive  investigations  on  mass  distribution  have  shown  that  the  one-,  two-, 
and  three-peaked  mass  distribution  in  low  energy  fission  arise  due  to  the  relative 
contributions  of  the  symmetric  and  asymmetric  modes  of  fission  which,  in  turn,  depend 
on  the  difference  in  the  barrier  heights  for  these  modes.  It  has  been  further  shown  that 
this  difference  strongly  depends  on  the  neutron  number  than  the  proton  number  of  the 
fissioning  nucleus  and  further,  the  variation  of  this  difference  with  mass  number  of  the 
fissioning  nucleus  explains  the  observed  variation  in  the  peak-to-valley  ratio  with  the 
fissioning  nucleus.  Enhanced  fission  yields  in  the  highly  asymmetric  mass  could  be 
understood  as  due  to  the  influence  of  the  28  p-shell.  Results  of  kinetic  energy 
distribution  demonstrated  the  influence  of  the  doubly  magic  shell  at  mass  1 32  on  the 
total  kinetic  energy  and  kinetic  energy  distribution.  The  fragment  deformation 
calculated  from  the  kinetic  energy  data  has  clearly  shown  that  the  symmetric  mode  is 
largely  governed  by  the  liquid  drop  behaviour.  Results  of  studies  on  charge  distribution 
and  fragment  angular  momentum  brought  out  the  influence  of  66  and  88  neutron 
deformed  shells  and  further  showed  their  greater  effectiveness  when  in  phase  with 
liquid  drop  energetics.  It  was  also  shown  that  the  extent  of  variation  of  charge 
polarisation  with  mass  asymmetry  depends  on  the  fissionability  parameter  due  to 
the  influence  of  the  saddle  point  shape  and  dynamics  of  descent.  Angular  momentum 
studies  provided  estimates  of  fragment  deformation  and  showed  that  the  odd  Z 
fragments  have  high  deformation  due  to  the  polarisation  of  even  Z  core  by  the  odd 
proton.  Results  of  studies  on  medium  energy  fission  have  shown  the  influence  of  the 
saddle  point  properties  on  charge  distribution  and  fragment  angular  momentum  due  to 
higher  excitation  energy  and  shorter  time  of  descent.  Thus,  our  studies  have 
contributed  to  a  better  understanding  of  the  nuclear  fission  process  particularly 
concerning  the  late  stage  of  fission  process  and  have  brought  out  the  importance  of  the 
deformation  energy  surface,  spherical  and  deformed  nuclear  shells,  nucleon  pairing  and 
the  dynamics  of  descent  past  the  second  saddle  point. 
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Studies  of  light-charged  particle  emission  in  fission  at  Trombay 
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Bhabha  Atomic  Research  Centre,  Trombay,  Bombay  400085,  India 

Abstract.  Studies  of  prompt  radiations  emitted  in  fission  were  started  at  Trombay  in  the  late 
1950’s  by  Dr  R  Ramanna  and  over  the  years  extensive  investigations  on  the  emission  of  prompt 
neutrons,  gamma  ray  and  K  x-rays  in  fission  were  carried  out  with  neutron  beams  from  apsara 
and  cirus  reactors.  In  the  early  1960’s  studies  on  the  emission  of  light-charged  particles  in 
fission,  which  is  a  rare  mode  of  fission,  were  also  started.This  paper  reviews  some  of  the  recent 
studies  on  the  emission  of  light-charged  particles  (lcp)  in  fission  which  were  carried  out  with  a 
view  to  investigate  the  mechanism  of  lcp  emission,  the  scission  configuration  and  the  dynamics 
of  the  last  stages  of  the  fission  process. 

Keywords.  Nuclear  fission;  light-charged  particle  emission;  emission  probability;  energy- 
angular-distribution. 

PACS  No.  25  85 


1.  Introduction 

Nuclear  fission  process,  which  involves  collective  motion,  large  scale  deformation  and 
subsequent  division  of  the  fissioning  nucleus  into  two  fragments  has  provided  over  the 
years  a  convenient  way  to  unfold  macroscopic  nuclear  behaviour  and  to  study  nuclear 
dynamics.  Only  in  recent  years  with  the  availability  of  energetic  heavy  ion  beams  the 
studies  of  nucleus-nucleus  collision  have  provided  a  complementary  and  perhaps  a 
more  powerful  way  to  investigate  the  domains  of  nuclear  macrophysics  which  could 
earlier  be  probed  only  through  the  study  of  fission  process.  In  the  fisson  process  once 
the  nucleus  is  given  sufficient  energy  to  reach  the  top  of  the  fission  barrier  (saddle  point) 
it  continues  to  elongate  until  it  splits  into  two  fragments  at  what  we  call  “scission 
point”.  The  deformed  fragment  nuclei  formed  at  scission  convert  almost  instantly  their 
deformation  energies  into  excitation  energy  via  damped  vibrations  around  their  stable 
shapes.  Simultaneously,  the  resulting  excited  fragments  undergo  acceleration  under  the 
mutual  Coulomb  repulsion  and  acquire  their  full  kinetic  energies  in  a  very  short  time 
10"  20  sec)  after  scission.  During  the  de-excitation  of  the  fragments  most  (~  90%) 
of  the  prompt  neutrons  are  emitted  after  the  fragments  have  reached  their  full  kinetic 
energies.  The  excitation  energy  below  the  neutron  binding  energy  is  given  off  as  prompt 
gamma  rays  via  gamma  transitions  cascading  down  from  the  excited  states  to  the 
ground  state  of  the  emitting  fragment.  A  small  fraction  (about  6  %)  of  these  transitions 
undergo  internal  conversion  resulting  in  the  emission  of  conversion  electrons  and  the 
characteristic  x-rays  of  the  fragments.  Occasionally,  along  with  the  two  heavier 
fragment  nuclei  the  fission  process  also  results  in  the  emission  of  a  light-charged 
particle  (lcp)  and  this  process  is  called  “LCP-accompanied  fission”  or  simply  as  “ternary 
fission”.  As  the  bulk  of  these  light  particles  are  long  range  alpha  particles  (lra)  this 
process  is  also  often  referred  to  as  “LRA-accompanied  fission”  particularly  when 
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measured  without  particle-identification  of  the  lcp.  As  this  process  has  a  very  low 
probability  of  occurrence  of  about  2  x  10“ 3  as  compared  to  the  normal  binary  fission, 
detailed  investigations  of  this  process  have  been  possible  only  for  thermally  fissile 
nuclei  with  the  use  of  high  flux  reactors  or  for  the  spontaneous  fission  of  252Cf. 

Investigations  on  the  nuclear  fission  process  were  initiated  at  Trombay  by 
Dr  R  Ramanna  in  the  late  1950’s  and  over  the  years  considerable  work  has  been  carried 
out  at  Trombay  with  neutron  beams  from  apsara  and  cirus  reactors  on  the  various 
aspects  of  the  fission  process  using  physical  and  radiochemical  techniques.  The  work 
using  radiochemical  techniques  has  been  reviewed  in  another  article  by  Dr  Ramaniah  in 
this  volume.  Among  the  earliest  investigations  in  fission  using  physical  techniques  at 
Trombay  were  the  measurements  of  the  energy  spectrum  of  the  neutrons  and  neutron- 
fragment  angular  correlations  in  thermal  neutron  induced  fission  of  235 U  (Ramanna  et 
al  1961;  Kapoor  et  al  1963).  The  neutron-fragment  angular  correlations  showed  that  a 
small  fraction  of  neutrons  ( ~  10  %  of  total)  does  not  originate  from  the  de-excitation  of 
the  moving  fragments  and  is  emitted  isotropically  in  the  laboratory  system  with  an 
evaporation-like  energy  spectrum.  In  this  work,  emission  spectrum  of  the  neutrons 
emitted  from  fully  accelerated  fragments  was  shown  to  be  a  superposition  of  various 
evaporation  spectra  with  a  distribution  of  nuclear  temperatures  of  the  form  P(T) 
—  2 T /Tmi  with  a  maximum  temperature  Tm.  This  type  of  distribution  has  formed  the 
basis  of  several  recent  theoretical  calculations  of  the  fission  neutron  spectra  (Madland 
and  Nix  1982).  In  a  subsequent  study  the  prompt  gamma-fragment  angular  corre¬ 
lations  were  measured  which  showed  an  anisotropic  gamma  ray  emission  (anisotropy 
~  10-15  %)  suggesting  the  occurrence  of  large  angular  momenta  of  fission  fragments 
brought  about  possibly  by  the  non-axial  splitting  of  the  neck-region  of  the  fissioning 
nucleus  at  scission  (Kapoor  and  Ramanna  1964).  Several  studies  on  the  emission  of  K  x- 
rays  from  fission  fragments  in  spontaneous  fission  of  252Cf  and  thermal  neutron 
induced  fission  of  235 U  were  also  later  carried  out.  Among  these  were  the  measure¬ 
ments,  for  the  first  time,  of  the  multiplicities  of  K  x-rays  emitted  by  the  fission  fragments 
of  specified  atomic  numbers  and  the  first  and  second  moments  of  the  x-ray  emission 
distribution  function  (Kapoor  et  al  1971a,b).  The  average  yields  and  emission  times  of 
K  x-rays  emitted  from  individual  fission  fragments  were  determined  and  from  these  the 
onset  of  a  new  region  of  nuclear  deformation  corresponding  to  neutron  number 
N  >  88  was  established.  It  was  also  seen  that  there  exists  a  significantly  large 
probability  of  cascade  emission  of  K  x-rays  in  several  cases. 

Among  other  areas  of  fission  physics  research  undertaken  at  Trombay  were  studies 
of  fragment  angular  distributions  (Nadkarni  et  al  1968b),  correlation  between  fragment 
anisotropy  and  asymmetry  in  fast  neutron-induced  fission  (Kapoor  et  al  1965)  and 
correlation  between  fragment  mass,  charge  and  kinetic  energy  in  thermal  neutron 
fission  (Govil  et  al  1983).  We  do  not  intend  to  discuss  these  investigations  of  binary 
fission  process  here  but  rather  focus  on  some  of  the  studies  of  the  Lcp-accompanied 
fission  undertaken  at  Trombay. 

The  emission  of  lcp  in  fission  has  several  characteristic  features:  (i)  they  are  emitted 
with  a  very  low  probability  ( PLCP)  which  is  one  in  about  300  to  500  binary  fissions,  (ii)  a 
majority  of  these  lcp  are  isotopes  of  H  and  He,  with  almost  90%  of  them  being  long 
range  alpha  particles  (lra),  (iii)  they  are  emitted  with  a  broad  energy  spectrum  with  a 
near-Gaussian  shape,  which  ranges  from  5  to  30  MeV  with  a  peak  around  1 5  MeV  and 
fwhm  of  about  1 0  MeV  and  (iv)  they  are  emitted  nearly  perpendicular  to  the  direction  of 
fission  fragment  motion,  more  precisely  at  an  average  angle  UaL  of  about  83°  with 
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respect  to  the  direction  of  motion  of  the  light  fragment.  It  has  been  inferred  from  these 
characteristics  that  they  are  emitted  close  to  the  scission  stage  from  the  neck  region 
between  the  two  nascent  fission  fragments.  In  order  to  deduce  information  about  the 
scission  configuration  as  well  as  the  dynamics  of  the  fission  process  at  the  instant  of 
their  emission  during  the  last  stages  of  scission,  more  detailed  information  about  the 
correlations  among  the  various  measured  parameters  is  required.  In  the  early  1960’s 
Ramanna  and  coworkers  started  investigations  on  the  emisson  of  lra  in  fission  and  the 
first  study  in  this  series  was  measurements  of  the  angular  distribution  of  the  lra  and 
ternary  fission  fragments  in  14MeV  neutron  induced  fission  of  238U  (Ramanna  et  al 
1963).  Numerous  other  studies  of  the  Lcp-accompanied  fission  followed  in  the 
subsequent  years  with  the  aim  of  investigating  not  only  the  mechanism  of  emission  of 
these  particles  but  also  the  scission  configuration  and  the  nuclear  dynamics  of  the  last 
stages  of  the  fission  process,  using  these  lcp  as  probes.  In  what  follows,  some  of  the 
recent  investigations  on  the  Lcp-accompanied  fission  carried  out  at  Trombay  are  briefly 
reviewed. 


2.  Studies  of  lcp  emission  in  235U  (nth,/) 

With  a  view  to  study  the  effect  of  the  expected  sharp  variation  of  the  Coulomb  field 
around  the  inter-fragment  (neck)  region  on  the  energy  of  the  lra,  the  energy  spectra  of 
lra  emitted  at  several  angles  with  respect  to  the  fragment  direction  were  measured 
(Nadkarni  et  al  1968, 1972).  From  the  energy  spectra  observed  at  these  angles  the  width 
represented  by  oe  of  the  Gaussian-shaped  LRA-fragment  angular  correlation  function 
was  deduced  for  lra  of  different  energies.  These  values  of  o0  vs  lra  energy  Ea  are  shown 
in  figure  1  where  a  shallow  minimum  of  oe  at  Ea  ~  17  MeV  is  observed.  It  was  noticed 
that  the  yield  of  lra  does  not  decrease  to  a  very  low  value  at  such  small  angles  as  27°  or 
1 1  °  as  would  be  expected  for  a  true  Gaussian  distribution.  Most  of  the  lra  yield  at  these 
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Figure  1.  The  variation  of  oe  of  the  Gaussian  angular  correlation  function  with  alpha 
particle  energy  (Nadkarni  et  al  1972). 
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small  angles,  belonging  to  a  non-Gaussian  component,  shows  a  significantly  higher 
most  probable  energy  (22-24  MeV)  (Nadkarni  et  al  1972).  lra  with  energies  beyond 
26  MeV  were  found  to  be  emitted  preferentially  along  the  direction  of  the  fragments 
(Nadkarni  and  Rama  Rao  1968a).  This  non-Gaussian  lra  component,  which  has  been 
called  polar  lra  (Piasecki  et  al  1970)  was  attributed  to  evaporation  from  moving  fission 
fragments  which  then  explains  the  observed  most  probable  energies  at  these  small 
angles.  Estimates  of  TJTn  based  on  the  statistical  theory  showed  that  at  least  some  of 
the  ‘polar’  lra  may  be  accounted  for  as  evaporation  from  moving  fission  fragments 
(Nadkarni  et  al  1972). 

Considerable  insight  into  the  mechanism  of  lcp  emission  in  fission  as  well  as  of  the 
fission  process  itself  can  be  got  from  a  knowledge  of  the  kinetic  energy  EKo  of  the 
fragments  at  the  moment  of  liberation  of  the  light-charged  particles.  Considering  that 
lcp  are  liberated  near  the  scission  configuration,  the  quantity  EKo  also  corresponds  to 
the  prescission  kinetic  energy  in  the  normal  binary  fission  process.  The  magnitude  of 
EKq  in  the  fission  process  is  expected  to  be  dependent  on  the  degree  of  coupling  of  the 
collective  and  single  particle  degrees  of  freedom,  and  is  therefore  related  to  the 
magnitude  of  nuclear  viscosity  operating  during  the  descent  of  the  nucleus  from  saddle 
to  scission.  A  direct  determination  of  the  magnitude  of  EKo  acquired  by  the  fragments  in 
a  time  of  the  order  of  few  times  10~21  sec  is  not  possible.  But  it  has  been  realized  for 
some  time  that  the  lcp  can  serve  as  probes  to  gauge  this  quantity  through 
measurements  of  the  various  correlations  in  lcp  accompanied  fission  (Frenkel  1967; 
Mehta  et  al  1973). 

In  the  LRA-fission  process  several  variables  or  parameters  are  involved  such  as  the 
fission  fragment  mass  (M  f),  total  fragment  kinetic  energy  ( EK ),  lra  energy  (Island  lra- 
fragment  angle  9af.  Although  in  earlier  studies  (Fraenkel  1967;  Mehta  et  al  1973) 
correlation  of  and  EK  has  been  measured  and  analysed  using  trajectory  calculations, 
a  definite  result  on  the  magnitude  of  EKo  has  been  difficult  to  deduce  from  a  limited  set 
of  measured  correlations.  As  with  more  detailed  data  on  differential  correlation  among 
all  possible  LRA-fission  parameters  it  beomes  possible  to  obtain  more  accurately  the 
scission  parameters  and  hence  EKo,  detailed  multiparameter  correlation  measurements 
between  the  parameters  Mf,  Ek,  Ea  and  6af  in  thermal  neutron  induced  fission  of  235U 
were  carried  out  at  Trombay  with  cirus  reactor  (Choudhury  et  al  1980).  A  back-to-back 
gridded  ionization  chamber  (figure  2)  was  used  to  measure  kinetic  energies  of  the  pair 
fragments  and  the  angle  6af  was  measured  by  an  electronic  method  based  on  the 
measurement  of  coincident  pulse  heights  of  the  grid  and  the  collector  pulses 
(Choudhury  et  al  1979).  Pulse  heights  of  the  two  collectors,  the  two  grids  and  of  the  lra 
detector  placed  along  the  electric  field  direction  of  the  chamber  were  recorded  event-by¬ 
event  on  a  magnetic  tape  by  means  of  a  6-parameter  data  acquisition  system.  The  data 
were  analysed  to  obtain  Mf,  EK,  and  0aL  for  each  event  and  a  probability  matrix  P 
{Mf,  Ek,  Ea,  Et,  0al)  was  constructed,  where  total  kinetic  energy  released  is  ET  =  EK 
+  £a,  and  0aL  is  the  angle  between  lra  and  the  light  fission  fragment.  From  this  matrix, 
the  correlation  between  any  two  parameters  for  specific  windows  on  the  remaining 
parameters  were  obtained.  Various  correlations  of  interest  were  deduced  and  analysed 
in  this  work  and  we  show  here  only  a  few  selected  results.  Figure  3  shows  the  ternary 
fission  fragment  mass  distribution,  where  the  correction  due  to  alpha  particle  recoil  has 
been  incorporated,  together  with  the  binary  fission  fragment  mass  distribution.  The 
light  fragment  mass  peak  is  observed  to  be  shifted  to  the  lower  side  whereas  the  heavy 
fragment  peak  is  not  shifted  appreciably.  This  has  been  interpreted  in  terms  of  the 
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Figure  2.  Schematic  diagram  of  the  experimental  set  up  used  in  the  lra  correlation 
experiment  (Choudhury  et  al  1980). 


Figure  3.  Mass  distribution  of  the  fission  fragments  in  binary  and  lra  accompanied  fission 
(Choudhury  et  al  1980). 


dependence  of  the  alpha  particle  emission  probability  Pa  on  fragment  mass 
(Choudhury  et  al  1980).  Figure  4  shows  the  variation  of  the  most  probable  value  of  the 
total  kinetic  energy  £rand  oEj  with  heavy  fragment  mass  MH  in  lra  and  binary  fission, 
where  a  significantly  larger  oEt  is  observed  in  lra  accompanied  fission  for  MH  ~~  130 
(near  closed  shell  region)  whereas  nearly  equal  oEt  is  observed  for  MH>  140  (away 
from  closed  shell).  These  results  were  interpreted  in  terms  of  differences  in  the 
dependence  of  Pa  on  the  interfragment  separation  ( d )  for  these  two  mass  regions.  An 
important  correlation  is  the  dependence  of  the  width  oe*L  of  the  LRA-light  fragment 
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Figure  4.  Variation  of  ET  and  aEj,  of  the  total  kinetic  energy  ( ET )  distribution  with  heavy 
fragment  mass  MH  in  binary  and  LRA-accompanied  fission  (Choudhury  et  al  1980). 


angular  correlation  on  the  total  fragment  kinetic  energy  EK  (figure  5).  If  the  variation  of 
with  Ek  is  examined  under  the  two  assumptions  that  the  variation  of  the  final  EK 
can  be  due  either  to  (i)  the  variation  of  the  prescission  kinetic  energy  EKo  or  (ii)  the 
variation  in  the  Coulomb  energy,  the  case  (i)  leads  to  a  larger  o0aL  with  increasing  EK 
whereas  the  case  (ii)  leads  to  a  smaller  ae  with  increasing  EK.  The  observed  oe*L  vs  EK 
favours  the  case  (ii)  implying  that  EKo  does  not  increase  with  EK  which  is  expected  only 
if  EKo  has  a  small  magnitude.  Trajectory  calculations  based  on  three-point  charge 
approximation  have  been  done  to  fit  these  data  to  arrive  at  an  initial  set  of  scission 
parameters  and  their  widths.  From  the  above  studies  it  has  been  concluded  that  the 
fission  fragments  have  a  very  small  prescission  kinetic  energy  and  that  the  alpha  particle 
is  liberated  close  to  either  of  the  two  fragments  and  slightly  off  the  axis  joining  the  two 
fragment  centres.  This  result  has  a  direct  relevance  to  the  nature  of  fission  dynamics  and 
points  to  a  strong  coupling  between  the  collective  and  internal  degrees  of  freedom 
suggesting  applicability  of  the  statistical  models  during  the  saddle  to  scission. 

In  thermal  neutron  induced  fission  of 235  U  a  still  more  rare  mode  of  fission  in  which 
two  light  charged  particles  are  emitted  simultaneously  together  with  the  two  heavier 
fragments  was  first  observed  at  Trombay  (Kapoor  et  al  1972).  The  probability  of  such 
events  was  estimated  to  be  about  one  per  million  binary  events  and  the  gross  features  of 
lcp  energy  spectrum  observed  in  this  case  were  found  to  be  similar  to  those  in  ternary 
fission.  Later  in  spontaneous  fission  of  252Cf  these  lcp  were  identified  as  3H  and 
4He  and  their  relative  yields  were  found  to  be  similar  to  that  in  ternary  fission.  Although 
their  energies  were  not  strongly  correlated  their  angles  of  emission  were  found  to  be 
well  correlated  (Kataria  et  al  1973).  Trajectory  calculations  were  done  using  different 
initial  conditions  and  the  one  which  assumed  that  the  two  lcp  are  emitted  in  the  neck 
region  statistically  independent  of  each  other  at  different  times  near  scission  gave  a 
better  fit  to  the  experimental  data. 
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Figure  5.  Dependence  of  of  the  angular  correlation  function  on  the  total  fragment 
kinetic  energy  for  different  windows  of  lra  energy  (Choudhury  et  al  1980). 


3.  Studies  of  lra  emission  in  fast  neutron  induced  fission 

3.1  Emission  probability  of  lra  and  lcp  in  fission 

As  the  mechanism  of  lcp  emission  in  fission  is  not  well  understood  and  very  little  was 
known  about  the  factors  which  decisively  influence  the  emission  probability  PLCP  one  of 
the  aspects  of  lcp  emission  investigated  in  detail  was  the  dependence  of  PLCP  on  the 
excitation  energy  of  the  fissioning  nucleus,  in  235U  («,/).  In  this  case  the  region  of 
incident  neutron  energy  of  particular  interest  is  thermal  to  about  5  MeV,  because  at 
much  higher  energies  contributions  from  second  and  higher  chance  fissions  complicate 
the  interpretation  of  the  results.  The  results  of  measurements  at  thermal,  2,3  and  4  MeV 
neutrons  showed  no  appreciable  change  in  the  Pa  with  neutron  energy  En  implying  that 
if  Pa  is  dependent  on  the  scission  configuration  or  excitation  energy  at  scission,  these 
latter  quantities  do  not  change  significantly  with  the  initial  excitation  energy  of  the 
fissioning  nucleus  (Nadkarni  and  Kapoor  1970).  In  a  later  work  the  yield  of  different 
lcp,  their  energy  spectra  were  measured  in  235U(n,/)  for  different  values  of  En  using  a 
A E-E  detector  telescope  to  identify  the  lcp  (Sharma  et  al  1981)  using  the  2  MV  Van  de 
Graaff  machine  at  iit  Kanpur.  In  this  study  increase  ~  20  %  in  Pa  for  En  ~  200  keV  was 
observed  compared  to  that  in  thermal  neutron  fission  in  good  agreement  with  an  earlier 
measurement  (Krishnarajulu  et  al  1977).  However,  the  yield  of  tritons  increased  with  En 
such  that  at  En  ~  550  keV,  it  was  about  3  times  higher  than  the  thermal  neutron  value.  A 
still  higher  increase  in  the  yield  of  protons  was  observed.  These  observations  of 
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dependence  of  PLCP  with  En  have  not  yet  found  satisfactory  explanation  and  need 
further  investigations.  Measurements  were  also  carried  out  to  study  the  dependence  of 
the  yield  of  polar  and  equatorial  protons  and  alpha  particles  on  incident  neutron  energy 
(En  ~  600  keV)  where  a  similarity  in  the  dependence  of  emission  probability  on  the 
excitation  energy  of  the  fissioning  nucleus  was  observed  both  for  the  polar  and 
equatorial  lcp  (Sinha  et  al  1980,  1982,  Sharma  et  al  1981). 

3.2  Angular  distribution  of  lra  and  ternary  fission  fragments  with  respect  to  the  incident 
neutron  direction 

The  angular  distribution  of  lra  and  ternary  fission  fragments  with  respect  to  the 
incident  neutrons  was  first  measured  by  Ramanna  et  al  in  14MeV  neutron  fission  of 
238U.  They  observed  a  forward-backward  peaked  angular  distribution  of  lra  and  a 
90°-peaked  distribution  of  ternary  fission  fragments.  These  results  suggested  that  the 
lra  emission  is  analogous  to  the  evaporation  of  particles  from  a  deformed  and  excited 
fissioning  nucleus  prior  to  scission.  The  observed  results  also  implied  that  lra  emission 
is  dependent  on  the  K-quantum  number,  lra  emission  being  less  probable  for  K  =  0 
than  for  the  case  where  K  is  large.  In  further  investigations  the  anisotropy  of  the 
angular  distribution  of  lra  with  respect  to  the  incident  beam  was  measured  in  3MeV 
neutron-induced  fission  of  23  5U  (Hattangadi  et  al  1965)  where  the  anisotropy  was 
found  to  be  in  agreement  with  the  value  calculated  on  the  basis  of  the  statistical  theory 
of  evaporated  particles.  The  observed  lra  anisotropy  was  compared  with  the 
anisotropy  of  the  ternary  fission  fragments  measured  in  3  MeV  neutron  fission  of  23  5U 
and  it  was  found  that  the  magnitude  of  the  observed  ternary  fragment  anisotropy  is 
equal  to  that  expected  on  the  basis  of  the  fact  that  lra  are  emitted  prior  to  scission  at 
right  angles  to  the  fission  fragments  (Nadkarni  1968).  The  question  of  K-dependence  of 
the  lcp  emission  probability  is  also  of  current  interest  in  the  studies  of  lcp  emission  in 
heavy  ion-induced  reactions. 


4.  Study  of  prompt  radiation  emitted  in  lcp  accompanied  fission 

Emission  of  K  x-rays  in  lcp  accompanied  fission  was  investigated  for  the  first  time  at 
Trombay  where  the  yields  of  K  x-rays  emitted  by  fission  fragments  in  lcp  accompanied 
fission  were  compared  with  that  in  binary  fission  in  the  case  of  spontaneous  fission  of 
252Cf.  The  observed  K  x-ray  yields  in  these  two  cases  did  not  seem  to  favour  the 
assumption  of  exclusive  emission  of  lra  from  either  of  the  two  fragment  groups 
(Kapoor  et  al  1968).  If  the  emission  of  lra  takes  place  after  scission  at  the  expense  of 
nucleons  only  from  some  fragments  the  isotopic  yield  distribution  of  the  fragment  of  a 
given  charge  would  vary  in  a  definite  manner  as  compared  to  that  in  binary  fission.  With 
this  in  view  the  isotopic  yield  distribution  for  fragments  of  specified  nuclear  charge  in 
lra  fission  relative  to  that  in  binary  fission  was  measured  in  spontaneous  fission  of 
252Cf  using  the  method  of  discrete  y-ray  lines  emitted  from  fragments.  A  number  of  y- 
ray  lines  were  assigned  to  specific  fission  fragments  and  the  ratios  of  intensities  of  these 
y-ray  lines  in  lcp  fission  and  binary  fission  were  determined.  Taking  2+  -»  0+  transition 
intensities  of  the  even-even  nuclei  to  be  a  measure  of  their  yields,  the  yields  of  several 
individual  fission  fragments  of  known  mass  and  charge  in  lcp  accompanied  fission 
relative  to  that  in  binary  fission  were  obtained  (Ajitanand  et  al  1975).  Figure  6 
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Figure  6.  Isotopic  mass  distributions  for  different  fission  fragment  charges  in  lra 
accompanied  fission  compared  to  those  in  binary  fission  (Ajitanand  et  al  1975). 


shows  the  isotopic  yield  distributions  for  different  fragment  charges 
(Z  =  40-60)  in  lcp  accompanied  fission  derived  from  this  experiment  and  these  are 
compared  with  those  in  binary  fission.  The  isotopic  mass  distributions  were  observed  to 
be  altered  in  the  light  and  heavy  fragment  groups  almost  to  the  same  extent.  If  lcp  were 
emitted  exclusively  from  either  of  the  fragment  groups  (light  or  heavy),  substantially 
different  shifts  in  the  isotopic  yield  distribution  would  be  expected.  Thus  these  results 
also  supported  the  view  that  lra  are  emitted  at  the  expense  of  nucleons  from  both  the 
light  and  heavy  fragment  groups  or  from  the  fissioning  nucleus  as  a  whole. 

To  sum  up,  the  investigations  of  lcp  emission  in  fission  briefly  reviewed  above,  have 
provided  information  on  the  various  facets  pertaining  to  the  mechanism  of  emission  of 
these  particles  and  have  also  proved  to  be  useful  in  exploring  the  dynamics  of  the  last 
stages  of  the  fission  process. 
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Development  of  thermal  reactor  lattice  physics  methods 
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Abstract.  This  paper  describes  the  reactor  physics  methods  developed  for  thermal  reactor 
lattices  in  India.  The  formulation  of  the  models  introduced  is  based  on  energy-dependent 
integral  neutron  transport  theory.  More  emphasis  was  put  on  the  development  of  lattice  cell 
calculational  methods  which  indeed  form  the  basis  of  physics  design  of  nuclear  reactors.  The 
physical  formulation  and  the  cross-sections  used  were  subjected  to  comprehensive  validation 
tests  through  analyses  of  experimental  information.  The  comparison  of  computed  and 
measured  parameters  have  amply  brought  out  the  soundness  of  the  physical  formalism  and  the 
cross-sections  used  in  our  calculational  procedures. 

Keywords.  Neutron  transport  equation;  collision  probability  methods;  lattice  cells:  multi¬ 
group  cross-sections;  light  water  lattices;  heavy  water  lattices;  rod-clustered  lattices. 

PACS  No.  28  40 


1.  Introduction 

Evaluation  of  neutron  economy  is  the  basic  problem  in  establishing  the  feasibility  of  a 
self-sustaining  chain  reaction  in  a  multiplying  medium.  The  basic  question  to  be 
answered  in  this  context  is,  “how  many  neutrons  are  emitted  in  fissions  and  what 
fraction  of  these  can  be  expected  to  induce  further  fissions  without  being  lost  in  some 
other  reactions”.  The  answer  will  essentially  involve  detailed  accounting  of  various 
mutually  competing  neutron  events  depending  upon  the  material  composition  and 
geometrical  configuration  of  the  system. 

The  most  obvious  way  of  solving  this  problem  is  the  direct  numerical  integration  of 
the  energy-dependent  neutron  transport  equation  for  the  system  under  study.  But  the 
number  of  variables  involved  coupled  with  the  accuracy  requirement  make  this  task  an 
impracticable  one  even  on  most  powerful  computers.  This  naturally  leads  one  to  seek 
simplifications.  A  number  of  techniques  have  thus  been  developed  since  about  1940. 
Better  understanding  and  accurate  prediction  of  reactor  behaviour  were  always 
stressed  upon  in  the  physical  models  evolved.  The  three  well-established  ways  of 
describing  neutron  events  in  a  thermal  reactor  are:  (i)  descriptive  set  of  formulae — the 
celebrated  four  factor  formula  with  one  or  two  group  leakage  model;  (ii)  few  group 
diffusion  or  transport  theory  methods;  and  (iii)  detailed  description  of  all  neutron 
processes  by  multi-group  transport  theory. 

In  this  paper,  we  will  be  concentrating  mainly  on  the  third  approach  and  discuss  the 
methods  evolved  by  us  in  this  area.  After  some  generalities  on  the  integral  transport 
theory,  we  will  return  to  the  methods  developed  by  us  in  this  subject. 

In  the  transport  theory  approach,  the  reactor  behaviour  is  predicted  through  a 
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detailed  assessment  of  various  physical  processes  involving  neutrons.  This  entails  an 
accurate  knowledge  of  neutron  flux  distribution  as  a  function  of  space  and  energy. 
Multigroup  transport  theory  methods  offer  an  immediate  solution  because,  both  in 
principle  and  in  practice,  these  methods  are  capable  of  giving  a  high  degree  of  accuracy 
and  reliability. 

The  energy  variable  of  neutron  flux  distribution  is  discretised  in  the  multigroup 
approach,  that  is,  the  entire  energy  range  is  divided  into  a  finite  number  of  discrete 
energy  groups.  The  selection  of  the  number  of  groups  and  their  boundaries  may  be 
decided  by  the  energy  behaviour  of  the  important  cross-sections  and  also  by  the 
operating  energy  range  of  the  system  under  consideration.  For  instance,  events  in  the 
thermal  energy  range  play  an  important  role  in  thermal  reactors  and  so  more  weigh tage 
may  be  given  to  this  energy  region. 

The  group  cross-sections  are  obtained  by  averaging  the  basic  nuclear  data  with 
appropriate  weighting  spectrum.  This  introduces  certain  arbitrariness  into  the  group 
cross-sections  because  both  weighting  flux  and  the  group  constants  are  inter¬ 
dependent.  Multigroup  structure  and  the  associated  cross-sections  adopted  in  our 
model  will  be  discussed  in  §4.  The  last  section  presents  the  results  of  a  comprehensive 
evaluation  of  our  model  through  analyses  of  a  wide  base  of  experimental  information. 


2.  Integral  transport  equation 

There  are  two  alternatives  for  treating  the  spatial  variable  of  the  transport  equation. 
The  integro-differential  ( Pn  and  Sn)  methods  and  integral  transport  theory  (Pi;  and  J  ± ) 
methods  (Huria  1976).  The  former,  though  mathematically  more  sound,  are  rather 
time-consuming  and  are  thus  not  suitable  for  repeated  calculations.  They  are,  in  fact, 
more  useful  for  obtaining  an  overall  reactor  behaviour. 

By  contrast,  collision  probability  methods  based  on  integral  transport  theory  have 
proved  to  be  very  effective  in  solving  even  the  complex  power  reactor  lattice  cell 
problems  at  relatively  low  computational  costs.  They  are  especially  suitable  for 
computing  cell  flux  distribution.  We  will  now  devote  some  time  to  the  integral  transport 
theory  and  its  application  to  lattice  cell  calculations. 

The  time-independent  transport  equation  with  isotropic  scattering  may  be  written  in 
the  form  (Williams  1966). 

2,(r,  £)®(r,  E) 

=  dr'js(r',E)  + 

J  v 

where  </>(r,  E)  is  the  angle-integrated  neutron  flux  and 


”  Xs(r',  E'  -  E)4>(t\  £')d£'|/>(r'  -» r,  E),  (1) 


P(r' 


r,  £)dr 


Z«(r,  E) 

4n  |  r'  -  r  | 2 


fK 


exp 
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I ,(s,  £)ds  jdr 


}. 


is  the  probability  that  a  neutron  of  energy  E  originating  at  the  point  r'  will  make  its  next 
collision  in  the  volume  element  dr  at  r.  In  terms  of  the  transport  kernel  T  (r'  —  r,  E) 
defined  by 
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exp< 

nr'-ri  ) 

2,(s,£)dsJ 

4n 

r'  -  r 

2 

T  (r'  ->  r,  £) 

P(r'  -►  r,  E )  may  be  written  as 

P  (r'  r,  £)dr  =  L,  (r\  E)T  (r'  r,  £)dr, 


(3) 

(4) 


T  is  the  probability  that  a  neutron  of  energy  E  originating  at  r'  will  reach  r  without 
making  any  collision  in  between. 

S  (r,  E)  is  the  number  of  neutrons  born  per  unit  volume  due  to  sources.  Zs(r,  E' 
-►  £)d£  is  the  cross-section  that  transfers  neutrons  of  energy  E '  to  the  energy  interval 
d£  around  £  at  the  point  r.  Other  symbols  have  their  usual  meaning. 

The  transport  equation  can  be  derived  directly  using  the  definitions  of  the  transport 
kernel  T  and  energy  transfer  cross-section  Zs(r,  £'  -»  £)  and  considering  the  neutron 
conservation. 

Let  us  now  specialise  the  transport  equation  to  the  cell  problem.  The  lattice  cell  under 
consideration  is  divided  into  certain  number  of  (say  N)  regions  each  with  constant 
cross-sections.  The  equation  then  becomes  (for  the  flux  in  the  ith  region) 

<Mr,  E) 


=  i  f  dKT(r' 

j  =  1  J  Vj 


r,  £)[Sj(r',  £)  + 


d  E'Z„(E' 


E)<t>j( r',  £')] 


where  the  indices  t  and  j  refer  to  individual  regions  related  to  r  and  r',  respectively. 

From  here  onwards,  we  will  talk  only  about  the  collision  probabilities  and  not 
transport  kernel.  In  terms  of  collision  probabilities,  the  above  equation  becomes 


I„<Mr,  E) 

=  £  |dr'P(r'-*r,£)[SJ(r',£)+  P°d£TSJ ,(£'-*£)  ^(r',F)].  (6) 

j= ij  Jo 


The  average  flux  in  region  i  is  defined  by 

4>i(E)  =  ~  I  dr0,(r ,£).  (7) 

Vi  J  V, 

The  assumption  of  constancy  of  cross-sections  in  each  spatial  region  has  enabled  us  to 
replace  the  complete  integral  by  the  sum  of  a  series  of  integrals.  If  we  are  interested  in 
the  total  number  of  collisions  at  energy  £  in  a  region  i ,  then  it  will  be  necessary  to 
integrate  (6)  over  all  points  r  in  region  i  giving 

£„(£)<£,  (£)£,• 


N 


=  I  £,,(£)  Kj[S,(£)  + 
;=  i 


°°  Esj(E'  -*  £)<M£')d£'] 
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where 


Pji(E)  = 


t *  r 

dr 

-  J 


<t>j( r',  £)P(r'-r,£)dr' 


£) dr' 


(9) 


168 


B  P  Rastogi  and  H  C  Huria 


Pji  is  the  probability  that  a  neutron  born  at  energy  E  through  a  collision  in  region  j  will 
make  its  next  collision  in  region  i. 

Equation  (8)  represents  a  set  of  N  coupled  integral  equations  for  the  average  flux  at 
energy  E  in  each  region  of  the  system.  No  approximations  have  been  made  so  far  and 
the  equation  in  this  form  is  exact  and  rather  general. 

The  solution  of  this  set  of  equations  evidently  requires  the  specification  of  collision 
probabilities  PtJ(£).  These  are  dependent  on  the  flux  distribution  as  is  clear  from  their 
defining  equation  (9).  The  most  widely  used  assumption  is  that  of  flat  flux  in  each  region 
at  all  energies.  This  enables  the  collision  probabilities  to  be  calculated  rather  easily. 
With  the  flat  flux  (and  source)  approximation,  the  set  of  equations  (8)  reduces  to 


I  AEWiWVi 


N 


=  I  PJl(E)[S,(E)  + 

;  =  l 


”zsJ(E'^  £)<£,.  (E')dE'] 
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with  collision  probabilities  given  by  (c.f.  (2)) 


Pji(E)  = 


y  •  ^ 

^ ti 


4nVj 


v, 


exp  - 
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r  — r 


(10) 


(11) 


The  reciprocity  theorem  which  states  that  the  flux  at  r'  due  to  a  source  at  r  is  equal  to 
the  flux  at  r  due  to  a  source  at  r'  leads  to  the  following  reciprocity  relation  for  the 
collision  probabilities: 


^ti(E)ViPij(E)  =  'Ltj(E)VjPji(E\ 


and  finally,  if  the  system  is  effectively  infinite  in  extent,  then  we  must  have 


(12) 


PijiE)  =  1  for  all  i , 


(13) 


because  a  neutron  born  in  region  i  must  make  a  collision  either  in  the  same  region  or  in 
any  of  the  other  regions  of  the  system. 

Thus,  the  equation  for  mean  reaction  rate  in  region  k  reduces  to 


fyk^tk  ^k 


N 


I  P*(E)[S,{E)  + 


i  =  1 


■00 


Za(E' -£)*,(£')<!£']. 


0 


(14) 


One  can  rewrite  the  neutron  balance  equation  for  the  average  reaction  rate  in  region 
k  in  terms  of  volume  and  surface  sources.  In  the  frame  of  multiregion  picture,  the 
surface  source  integral  is  expressed  in  terms  of  partial  surface  integrals,  and  the  volume 
source  in  subregion  integrals  (Emendorfer  1973).  Assuming  that  all  the  partial  currents 
have  the  same  angular  distribution  at  each  surface,  each  region  gets  coupled  to  its 
adjacent  ones  through  incoming  currents.  The  balance  equations  could  thus  be  written 
as: 


<t>k(E)Z,k.Vk  =  VtQtP*v  +  P*yj;-l+P*yJi 


j k  =  VkQkPl‘yo  +  Jk-lPl‘o  +  JkPk 


iO 


00 » 


(15) 

(16) 


where  PkXY  are  the  first  flight  collision  probabilities  with  x  and  y  indicating  the  origin 
and  exit  processes  respectively.  For  example,  Pkyy  IS  the  probability  that  a  neutron  born  in 
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region  k  will  make  its  next  collision  in  the  same  region;  P*0)K  is  the  probability  that  a 
neutron  entering  region  k  from  its  inner  (outer)  surface  will  collide  within  k.  are  the 
total  outward  (4-)  and  inward  (  — )  currents  at  the  outer  surface  of  region,  k .  Qk  is  the 
total  source  which  includes  both  fission  and  scattering  sources,  that  is, 


ft  =  $,(£)  + 


00 


0 


Is,.(£'-+E)<M£')d£' 


(17) 


The  first  form  of  the  balance  equation  [equation  (14)]  is  known  as  the  concept. 
Herein  each  region  is  coupled  to  all  the  regions  of  the  system  by  first  flight  collision 
probabilities.  The  second  form  [equations  (15)  and  (16)]  is  known  as  the  J  ±  (interface 
currents)  concept  where  each  region  is  coupled  with  its  adjacent  regions  by  surface 
currents.  Both  the  concepts  are  equivalent  as  far  as  the  correct  space  and  angular- 
distributions  for  source  and  current  densities  are  involved.  But  the  latter  form  is  simpler 
to  handle  from  the  viewpoint  of  computer  storage  and  time  requirements.  The 
additional  assumption  that  all  the  partial  currents  have  the  same  angular  distribution, 
which  helped  to  realise  this  simplicity,  will  definitely  entail  a  loss  in  accuracy.  However, 
the  advantage  accruing  from  the  savings  in  computational  cost  far  outweighs  the 
marginal  loss  in  accuracy. 

In  the  multigroup  form,  the  two  sets  of  equations  may  be  written  as 


L<K^  =  I  VjQ'jP'jt, 


tk 


with 


g  9  ~*g 
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i*i  v* = vkQin,vv+Jt-iPiv+Jvn 

tk 
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(18) 

(19) 

(20) 
(21) 


where  the  superscript  g  indicates  the  energy  group  and  index  k  denotes  the  spatial 
region  k. 


3.  Multigroup  transport  theory  methods  developed  in  India 

Let  us  now  review  our  work  in  connection  with  the  development  of  multigroup 
methods  for  thermal  reactors  (Huria  1976).  For  obvious  reasons  we  selected  the  J1 
concept — commonly  called  the  interface  current  approach — for  the  lattice  cell 
calculations  of  thermal  reactors,  both  light  water  and  heavy  water  moderated.  The 
lattice  cell  calculations  in  fact  form  the  basis  of  physics  design  of  nuclear  reactors 
because  they  provide  the  necessary  input  for  global  reactor  calculations.  Moreover, 
they  have  independent  importance  as  well — they  provide  information  on  the  fuel 
element,  reaction  rate  and  power,  the  isotopic  depletion  and  build-up,  and  for  the 
interpretation  of  post-irradiation  burn-up  measurements. 

3.1  Cell  calculation 

The  physical  formalism  of  our  lattice  physics  code  murli  (Huria  1978,  1981)  is  briefly 
described  below.  The  balance  equation  for  average  reaction  rate  in  region  k  bounded  by 
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surfaces  k  —  1  and  k  may  be  written  as  (dropping  the  group  index  for  the  sake  of 
simplicity). 

X  </>*  Vk  =  QkPr  +  Jk+- 1  Pkv+Jk  Pov  (22) 

tk 

where  Qk  is  the  total  source,  that  is,  the  number  of  neutrons  (of  energy  group  g)  born  in 
region  k\j£  is  the  total  outward  (superscript  +)  and  inward  (superscript  — )  at  the  kth 
surface;  PIkv(P$v)  is  the  probability  that  a  neutron  entering  region  k  from  its  inner 
(outer)  surface  will  make  its  next  collision  within  the  region  k;  and  Pky  is  the  probability 
that  a  neutron  born  uniformly  and  isotropically  in  region  k  will  collide  therein  again. 

The  above  equation  must  be  supplemented  by  the  conservation  equations  for 
neutron  currents  at  the  two  surfaces.  Equations  for  the  outgoing  currents  are 

Jk  =  Pk°Qk  +  PS°Jk  +  PL°Jk- 1 ,  (23) 

Jt-^PPQk  +  Pg'Jt-  (24) 

Physical  interpretation  of  these  equations  is  quite  obvious.  The  source  term  is  given  by 


G  g'^9  1  G  g' 

Ltf'  =  l  Sk  /V=1  fk 


(25) 


where  X  is  the  eigenvalue. 

If  we  separate  the  source  term  into  a  self-scattering  part  and  the  remaining 
contribution,  Qgk  may  be  written  as 

er«fIf <ttVk+Qkg,  (26) 

Sk 


with 
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The  equation  for  the  collision  rate  then  becomes 
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ittvA  1  -~-rr)  =  Ql9Pr+P2yJk-+PLyJ?- 1  (28) 

“  v  z  ; 

tk 

Let  us  now  define 

g~*g  lg 

Tk  —  Yj  /X  (number  of  secondaries  per  collision),  (29) 

Sk  /  tk 

wr=pnv-Tkpn,  m 

Wkv=PLv/V-TkPr\  (31) 

W2v=P!v/(l-TkPD-  (32) 
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With  the  introduction  of  Wky"s  (28)  reduces  to 

Vk  =  WQ*  +  WJk  +  i  ■  (33) 

Ik 

One  can  similarly  derive  the  conservation  equations  for  currents.  These  are,  finally 

Jk  =  WZ°Q’k  +  MVS°Jk  +  M°K- 1 ,  (34) 

Jk- 1  =  K'Q'k  +  WPJi  +  Wl’Jt- . ,  (35) 

with 

WP>~P[°/(i-TtPn,  (36) 

WZ<  =  PPK 1  -TkPn,  (37) 

w?°  =  n° + nyTk  prn  i  -  rk  pd,  (38) 

w?  =  P£l  +  PZvTkP?/(  1  -Tkpr\  (39) 

^  =  PL0  +  PLvTkP[°/(  1  -7i?r),  (40) 

Wli  =  Pl'TkP{I/(l-TkPn.  (41) 


The  are  related  to  the  first  flight  collision  probabilities  as  shown  above.  They  can 
also  be  interpreted  as  multiple  collision  and  escape  probabilities  as  discussed  by 
Driggers  (1964).  The  superscripts  x  and  y  denote  the  origin  and  exit  processes  which 
may  be  (i)  collision  event  [  V],  (ii)  entering  or  leaving  through  inner  surface  [/],  and 
(iii)  entering  or  leaving  through  outer  surface  [0].  For  instance  W$v  represents  the 
number  of  collisions  caused  in  regions  k  by  one  neutron  entering  from  its  outer  surface. 
Similarly,  W%1  is  the  number  of  neutrons  leaving  region  k  from  its  inner  surface  per 
neutron  that  enters  it  through  its  outer  surface.  The  neutrons  leaving  result  either  from 
no,  one  or  more  collisions  that  the  incoming  neutron  might  undergo  in  the  region. 

It  is  seen  that  the  cosine  current  approximation  leads  to  balance  equations  of  block 
diagonal  form,  each  block  representing  a  2  x  2  matrix  of  incoming  and  outgoing 
currents  at  each  surface.  This  feature  results  in  substantial  reductions  in  computer 
storage  requirements. 

These  equations  could  be  solved  by  appropriate  matrix  manipulations.  However,  a 
slightly  different  technique  is  adopted  in  the  present  model  (Driggers  1964). 

The  currents  at  the  kth  surface  are  expressed  in  terms  of  those  at  the  (k  —  1)  is  as 
follows: 

Jk  =M+-i+4  (42) 

Jk  =Dk  +  1Jk++Hk+l.  (43) 

Substitution  of  these  into  the  current  conservation  equations  [(34)  and  (35)]  leads  to 
recurrence  relations  for  the  constants  Ak ,  Bk ,  Dk  and  Hk.  With  a  specified  boundary 
condition  (say  at  the  outermost  region  N  of  the  cell)  like  J~  —  or  J~  =  constant, 
all  the  relational  constants  could  be  evaluated. 

Now,  the  only  conservation  equation  for  the  centremost  region  is 

j  r  ^wrQi+wrj;,  m 

which  could  be  reduced  to 

Jt  =(W01°H2  +  W\°Ql)l(\-D2wr\ 

J~  =D2J?  +tf2. 


(45) 

(46) 
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With  the  starting  currents  given  by  (45)  and  (46),  the  currents  at  the  other  surfaces  can 
be  easily  derived  using  relations  [(42)  and  (43)]. 

The  first  flight  collision  probabilities  as  desired  for  the  evaluation  of  Wjp's  are 
calculated  from  the  expressions  proposed  by  Kennedy  (1964). 


3.2  Leakage  calculation 

The  cell  calculations  give  directly  the  infinite  multiplication  factor  Kx.  The  effective 
multiplication  factor  Keff  is  obtained  by  representing  the  leakage  normally  through 
experimental  buckling.  This  facilitates  the  leakage  calculation  to  be  done  by  point 
model,  that  is,  zero-dimensional  multigroup  diffusion  theory  with  cell  homogenized 
cross-sections.  This  procedure  was  verified  by  performing  multigroup  one-dimensional 
diffusion  theory  calculations  for  some  representative  cases.  This  latter  procedure, 
however,  becomes  desirable  for  relatively  small  systems  having  large  neutron  leakages. 

Besides  the  limitations  of  the  point  model,  two  other  effects  that  have  bearing  on 
leakage  treatment  are  the  scattering  anisotropy  and  heterogeneity  correction.  The 
present  model  does  not  account  for  the  former  explicitly  except  for  the  use  of  transport 
corrected  total  cross-sections.  The  latter  effect  for  the  regular  light  water  lattices  is  not 
likely  to  be  significant,  because  of  the  fact  that  the  transport  cross-sections  of  light 
water  and  U02  are  not  too  different. 

3.3.  Burn-up  calculations 

Burn-up  prediction  is  another  important  aspect  of  physics  design  of  reactors.  It 
supplies  information  on  isotopic  changes  and  their  effect  on  reactivity,  power 
distribution,  kinetic  characteristics,  etc.  The  main  outcome  of  the  lattice  cell  burn-up 
calculations  is  the  energy  and  space-averaged  constants  at  desired  exposures  as 
required  for  the  global  reactor  calculations.  This  will  obviously  involve  the  recompu¬ 
tation  of  lattice  cell  spectrum  at  the  specified  burn-up  values. 

The  isotopic  build-up  and  depletion  equations  of  fissile,  fertile  and  ‘futile’  (fission 
product)  nuclei  are  written  as 

=  \®Xj?ij0fjNj  +  ®0'iNi-l+XiN'l] 

+  j 

where  Nt  =  number  density  of  the  ith  type  of  nuclei;  yi;-  =  fission  yield  of  the  ith  nuclei 
from  the  fission  of  j th  nuclei;  dc  —  average  capture  cross-section;  of  —  average  fission 
cross-section;  oa  =  ac-\-of  —  average  absorption  cross-section;  X  —  decay  constant  and 
<D  =  average  flux. 

The  terms  within  the  first  pair  of  square  brackets  are  rates  of  production  of  ith  nuclei 
from  (i)  fission,  (ii)  neutron  capture  and  (iii)  decay  from  other  nuclei,  respectively.  The 
terms  within  the  second  pair  are  the  losses  by  (i)  neutron  absorption,  and  (ii)  decay  to 
other  nuclei. 

These  equations  are  in  reality  non-linear,  because  the  flux,  iVf’s  and  a? s  are  time- 
dependent.  Assuming  the  neutron  flux  to  be  constant  over  the  given  time  interval,  the 
coefficients  of  iV,  become  constant  and  the  system  of  equations  becomes  linear  within 
that  time  interval.  This  makes  the  integration  completely  straightforward.  Thus  the 
generation  of  burn-up  dependent  parameters  will  imply  periodic  recomputation  of  the 
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coefficients  of  the  aforesaid  equations  using  the  up-dated  neutron  flux  corresponding 
to  a  new  isotopic  composition. 

The  total  number  of  burnable  (plus  fission  product  nuclei)  in  our  burn-up  model  is  1 8 
which  includes  4  pseudofission  products  emanating  from  the  chains  of  uranium  and 
plutonium,  rhodium,  xenon  and  a  lumped  fission  product.  The  remaining  are  fissile  and 
fertile  nuclei.  These  equations  are  optionally  solved  by  either  5th  order  Runge-Kutta 
method  or  by  trapezoidal  rule. 

4.  Multigroup  structure  and  cross-sections 

Energy  variation  of  the  neutron  spectrum  is  calculated  by  the  standard  multigroup 
procedure.  The  group  structure  and  the  cross-sections  of  the  wims  (Askew  1966)  code 
have  been  adapated  in  our  calculational  model.  The  main  aim  in  selecting  the  group 
boundaries  of  wims  has  been  to  provide  enough  groups  so  that  the  group  constants  may 
be  considered  effectively  independent  of  the  weighting  spectra  used  to  generate  them. 

The  entire  energy  range  is  divided  into  69  groups.  The  fast  range  defined  to  lie 
between  10  MeV  and  9-1 18  keV  is  divided  into  14  groups  of  equal  lethargy  width.  The 
limits  of  13  resonance  groups  (9T 1 8  keV-4  eV)  have  been  selected  to  give  adequate 
representation  to  the  locations  of  important  resonances  of  main  fissile  and  fertile 
isotopes.  Forty  two  groups  describe  the  thermal  range  below  4eV.  Here  the  group 
boundaries  have  been  chosen  to  cluster  around  the  thermal  energy  resonances  of 
plutonium  isotopes  and  around  the  peak  of  thermal  spectrum. 

The  basic  cross-sections  have  been  derived  from  uk  Nuclear  Data  Files  and  then 
processed  into  multigroup  constant  through  galaxy  (Bell  1964)  code.  The  weighting 
spectrum  for  producing  fast  cross-sections  is  that  typical  of  an  undermoderated  light 
water  system.  The  cross-sections  have  been  checked  to  reproduce  measured  age  in  the 
principal  moderators. 

The  cross-sections  in  the  resonance  range  are  not  only  spectrum-dependent  but  also 
change  with  temperature  and  composition  of  the  system.  The  library  contains 
tabulations  of  group  resonance  integrals  for  234U,  235U,  236U,  238U  and  239Pu  in  terms 
of  potential  scattering  and  temperature.  These  have  been  derived  from  the  numerical 
integration  of  the  slowing  down  equation  in  the  homogeneous  media  of  the  absorber 
and  a  moderator  at  different  temperatures  through  the  sdr  (Brissenden  and  Durston 
1965)  code.  For  actual  heterogeneous  cell  calculations,  the  effective  potential  scattering 
cross-section  is  obtained  through  “equivalence  theorems”  according  to  the  prescrip¬ 
tions  of  WIMS. 

Thermal  scattering  matrices  derived  from  various  scattering  kernels  for  the  usual 
moderating  materials  have  been  tabulated  at  typical  reactor  temperatures.  All  other 
materials  have  gas  law  scattering  matrices.  The  weighting  spectra  used  to  obtain  other 
cross-sections  are  Maxwellian  with  \/E  tails.  These  have  been  generated  by  the  code 
pixse. 

The  wims  cross-sections  (and  procedures)  have  been  subjected  to  systematic  checks 
through  comparisons  with  measured  parameters  and  results  from  sophisticated 
Monte-Carlo  calculations  (Fayers  1967;  Kemshell  1972;  Chawla  1972). 

Apart  from  the  69-group  library,  condensed  28  and  27  group  libraries  have  been 
generated  using  typical  operating  bwr  and  hwr  spectra  for  the  routine  calculations  for 
light  water  and  heavy  water  power  reactors.  The  condensed  libraries  use  the  69-group 
library  as  the  basic  cross-sections. 
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An  exercise  to  generate  resonance  group  cross-sections  for  23 5 U,  238U  and  239Pu  has 
also  been  carried  out  using  a  simplified  procedure  of  integrating  the  slowing  down 
equation  by  multigroup  technique.  The  fine  group  cross-sections  were  prepared  using 
the  data  from  endf/b-iv.  The  group  resonance  integrals  calculated  in  this  way 
agreed  very  well  with  those  of  wims  library.  The  resonance  cross-sections  for  232Th  and 
233U  were  generated  by  this  procedure  using  the  basic  data  from  endf/b-iv.  These 
have  been  incorporated  into  the  library.  Moreover,  the  cross-sections  of  main  fissile  and 
fertile  materials — 235U,  238U,  239Pu,  240Pu,  241Pu  have  also  been  generated  and 
compared  with  the  original  library. 

5.  Evaluation 

The  general  objective  in  the  development  of  murli  code  was  to  acquire  an  efficient  and 
reliable  calculational  tool  for  power  reactor  design  studies.  The  adequacy  and  reliability 
of  a  calculational  model  could  be  established  by  analysing  a  wide  variety  of  relevant 
experimental  results.  In  this  context,  the  measurements  in  clean  uniform  lattices 
provide  a  sound  base  for  evaluating  a  code.  The  model  should  be  able  to  predict 
accurately  the  properties  of  simple  uniform  systems.  If  it  cannot  do  so,  it  is  unlikely  to 
give  reliable  results  for  the  operating  power  reactors,  which,  in  reality,  have  complex 
lattice  structure. 

Information  for  a  large  number  of  critical  and  exponential  experiments  was  collected 
for  investigating  the  validity  of  the  physical  model  of  murli  and  its  associated  cross- 
sections.  The  list  includes  not  only  23 5 U  enriched  systems,  but  mixed  oxide 
(Pu02-U02)  and  (232Th-233U)  oxide  experiments  as  well.  Besides  the  recommended 
benchmarks  (csewg  1978),  the  selected  experiments  cover  a  broad  range  of  parameters 
of  importance  in  physics  design  aspects  like  moderator-to-fuel  volume  ratio,  fuel  pellet 
size,  fuel  enrichment  (and  isotopic  composition),  and  buckling,  etc. 

We  will  begin  with  the  analyses  of  light  water  moderated  systems.  The  investigations 
include  not  only  the  calculation  of  reactivity  but  also  the  comparison  of  measured  and 
calculated  reaction  rate  ratios  (wherever  available).  The  following  two  sections  present 
the  results  of  extensive  validation  tests  carried  out  for  the  selected  experimental  data  on 
critical  and  exponential  experiments  moderated  by  light  water. 

Part  I:  Light  water  lattices 

5.1  The  effective  multiplication  factor.  The  calculation  of  reactivity  is  the  most 
fundamental  problem  of  all  reactor  design  studies.  The  calculational  model  should  be 
able  to  predict  accurately  this  important  characteristic  of  the  reactor  system.  All  the 
other  parameters  like  reactivity  coefficients,  control  rod  worth  fuel  depletion  effects, 
etc.  are  translated  in  terms  of  changes  in  the  multiplication  factor,  murli  was  used  to 
calculate  the  effective  multiplication  factor  for  a  large  number  of  clean  uniform  lattice 
experiments,  the  results  of  which  are  discussed  below  (Huria  1981). 

(i)  U -metal  lattices:  44  bnl  (Price  1966)  exponential  experiments  using  enriched 
uranium  metal  rods  were  analysed  by  murli.  The  enrichment  varied  from  1  to  1-3  w  % 
and  moderator-to-fuel-volume  ratio  from  1  to  4.  The  mean  and  the  standard  deviations 
of  predicted  were  0-9981  and  ±0-00477. 

(ii)  U 02  experiments :  55  enriched  U02-rod  fuelled,  light  water  moderated  critical  and 
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exponential  experiments  (Bernocchi  1977)  with  a  comparatively  wider  range  of 
enrichment  (1*3  to  5*7  w  %)  and  H20  to  U02  volume  ratio  (1  to  5)  were  analysed.  The 
mean  and  the  standard  deviations  of  the  computed  eigenvalues  for  all  the  cases 
considered  together  were  0-9974  and  ±  0-00578. 

(iii)  Mixed  oxide  experiments ;  The  plutonium  systems  are  theoretically  more  difficult 
to  calculate  than  the  uranium  ones.  This  is  due  to  the  complex  cross-section  behaviour 
of  plutonium  isotopes.  In  particular,  239Pu  and  240Pu  have  large  thermal  energy 
resonances  leading  to  relatively  more  significant  thermal  heterogeneities.  Moreover, 
the  fission  cross-section  of  239Pu  has  a  non  1  /v  behaviour.  Another  inherent  (and  rather 
undesirable)  characteristic  of  nearly  all  the  plutonium  lattice  experiments  is  their  small 
size  and  consequently  a  larger  fraction  of  neutrons  leaking  out.  Thus,  the  predictive 
capability  of  a  calculational  model  for  mixed  oxide  system  will  present  a  very  severe  test 
for  its  formulation  and  the  cross-sections. 

With  the  above  object  in  view  we  selected  a  set  of  53  experiments  covering  wide 
spectrum  of  relevant  physics  design  parameters.  The  discrepancies  in  the  calculated 
keff’s  were  within  ±  6  mk  (06  %  of  unity). 

(iv)  (233U02-232Th02)  lattices :  There  are  not  many  experiments  using  (232Th-233U) 
bearing  fuel.  However,  the  bnl  (Windsor  1970)  exponential  experiments  with  (3% 
233U02-97%  233Th02)  fuel  have  been  frequently  used  as  experimental  benchmarks 
for  evaluating  both  the  computational  techniques  and  the  data.  We  have  also  used  the 
same  experiments  for  testing  the  adequacy  of  murli  for  predicting  the  parameters  of 
(233U-232xh)  oxide  bearing  fuels.  The  murli  calculations  for  kefr  are  quite  satisfactory 
and  are  in  reasonable  agreement  with  results  from  wims  and  Monte-Carlo. 

5.2  Comparison  of  reaction  rate  ratios.  The  global  situation  represented  by  the 
effective  multiplication  factor  could  be  satisfactorily  and  reasonably  predicted  for  a 
wide  variety  of  uniform  critical  and  exponential  experiments.  The  performance  of  the 
theoretical  treatments  for  various  neutronic  events  separately  could  be  better  assessed 
from  the  comparison  of  calculated  and  measured  reaction  rates.  The  recommended 
benchmark  experiments  could  be  best  utilised  for  this  purpose. 

We  have  also  compared  relative  reaction  rates  with  the  corresponding  measured 
values  for  a  number  of  experiments  including  the  lattices  recommended  by  the  Cross 
Section  Evaluation  Working  Group.  The  results  were  satisfactorily  consistent  and 
within  experimental  uncertainties  (Huria  1981). 

Part  II:  Heavy  water  lattices 

5.3  The  effective  multiplication  factor.  For  the  evaluation  of  murli  for  predicting  the 
physics  characteristics  of  heavy  water  moderated  lattices,  we  have  selected  the 
compilation  of  relevant  data  given  by  Honeck  and  Crandal  (1964).  This  set  includes 
only  natural  uranium  metal  single  rod  lattices.  There  is  a  very  wide  range  of  rod  radius, 
lattice  pitch,  buckling  and  D20  purity  covered  by  these  lattice  experiments.  The  present 
analysis  includes  only  reactivity  predictions  for  these  lattices.  As  regards  the  relative 
reaction  rates,  the  benchmarks  recommended  by  Cross-Section  Evaluation  Working 
Group  were  considered,  the  results  of  which  will  be  discussed  in  §5.4. 

Besides  the  single  rod  lattices,  rod  clustered  systems  were  also  considered.  The 
calculational  model  had  to  be  modified  to  include  the  appropriate  heterogeneity  effects 
of  the  cluster.  We  will  come  to  this  aspect  in  §5.5. 
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Natural  U -metal  rod  lattices :  The  mean  and  standard  deviations  of  the  computed 
eigenvalue  (Kefr)  for  the  90  single  rod  lattices  are  0-99927  and  0-003666  respectively. 
This  indicates  that  the  formulation  and  the  associated  cross-sections  are  quite  adequate 
in  predicting  the  reactivity  of  heavy  water  lattices  over  a  broad  spectrum  of  physics 
design  parameters. 

5.4  Comparison  of  reaction  rates.  The  Cross-Section  Evaluation  Working- Group  of 
bnl  has  recommended  the  slightly  enriched  U-metal  fuelled  experiments  done  at  mit  as 
benchmarks  for  evaluating  the  nuclear  data  for  the  heavy  water  moderated  systems 
(csewg  1978).  We  have  studied  these  lattices  for  evaluating  our  model  (and  cross- 
sections  as  well). 

The  effective  multiplication  factors  were  seen  to  be  very  close  to  unity,  the  maximum 
deviation  being  0-57  %.  But  the  discrepancies  noticed  in  the  comparison  of  calculated 
and  measured  reaction  rate  ratios  were  outside  the  acceptable  range  of  accuracy.  Even 
more  accurate  simulation  of  the  experimental  set-up  by  two-dimensional  finite 
difference  diffusion  theory  and  by  finite  element  method  did  not  lead  to  any  marked 
improvement  in  these  comparisons.  It  could  be  added  here  that  similar  observations 
have  been  made  by  other  authors  also  for  these  benchmarks. 

5.5  phwr  cluster  experiments.  A  number  of  experiments  have  been  carried  out  in 
Canada  with  7-,  19-,  and  28-rod  clusters  with  different  coolants.  All  the  clusters  have 
natural  U02  fuel  pellets  with  zircaloy-2  cladding  and  the  lattices  are  moderated  by 
heavy  water  (Green  1962, 1964a,b;  Serdula  1966;  De  Lange  1966).  These  were  analysed 
by  two  different  code  systems  described  below: 

Climax — It  is  an  extension  of  murli  to  treat  the  phwr  cluster  lattices.  A  ringwise 
homogenisation  of  the  cluster  is  adopted  for  spatial  flux  distribution  calculation. 
However,  the  heterogeneity  effects  in  fast  and  resonance  energy  ranges  are  obtained 
through  collision  probability  treatment  of  explicit  cluster  geometry. 

Club — The  physical  model  of  club  (Krishnani  1982)  is  based  on  a  combination  of  Ptj  and 
J ±  methods.  The  interaction  between  various  zones  within  a  ring  and  their 
contribution  to  outgoing  currents  at  ring  surfaces  are  calculated  by  method,  while 
each  region  is  connected  to  the  neighbouring  regions  by  the  J  ±  approach.  The  angular 
flux  at  each  region  interface  is  expanded  in  terms  of  spherical  harmonics  to  correct  for 
deficiencies  of  the  cosine  current  approximation. 

The  general  observation  emerging  from  computed  results  is  that  the  calculations  of 
climax  and  club  agree  consistently  with  the  corresponding  experimental  values  and 
also  amongst  themselves. 


6.  Conclusions 

The  multigroup  integral  transport  theory  provides  extremely  fast  and  accurate 
methods  for  the  analysis  of  thermal  reactor  lattices.  We  have  developed  a  number  of 
computer  codes  based  on  these  methods.  These  are  being  used  for  the  design,  analysis 
and  operational  problems  of  our  light  water  and  heavy  water  moderated  power 
reactors. 
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Special  features  of  the  safety  and  control  systems  of  the  Dhruva 
reactor 


A  K  RAY  and  S  N  SESHADRI 

Reactor  Control  Division,  Bhabha  Atomic  Research  Centre,  Bombay  400  085,  India 

Abstract.  The  prime  requirement  of  reactor  safety  combined  with  the  need  for  high 
availability  of  nuclear  plants  have,  in  recent  years,  led  to  considerable  research  and 
development  efforts  at  the  Bhabha  Atomic  Research  Centre  in  the  field  of  reactor  safety  and 
control  engineering.  The  areas  of  special  interest  have  been  the  development  of  a  fast  acting 
emergency  shutdown  system,  on-line  fault  detection  facility  for  the  reactor  protection  circuits, 
enhanced  instrumentation  capability  for  measurement  of  critical  plant  parameters  and 
computerised  systems  for  plant  protection,  control,  performance  evaluation,  disturbance 
analysis,  and  data  acquisition  and  display  with  particular  attention  to  the  problem  of  man- 
machine  interface.  Some  of  these  recent  concepts  have  been  incorporated  in  safety  and  control 
systems  of  the  Dhruva  reactor  which  is  at  present  undergoing  commissioning  trials  at 
Trombay.  The  special  features  of  these  systems  are  highlighted  in  the  paper.  The  safety  strategy 
adopted  for  the  reactor  and  the  consequent  development  of  special  safety  systems  are 
described  in  detail.  The  choice  of  the  reactor  control  scheme  and  the  methodology  followed  in 
the  design  of  the  automatic  power  control  system  are  indicated.  Campbell  instrumentation  for 
measurement  of  neutron  flux  or  in  other  words  reactor  power,  extensive  use  of  micropro¬ 
cessors  in  safety  related  instrumentation  and  an  improved  man-machine  interface  through 
suitable  design  of  control  room  have  helped  in  achieving  a  high  degree  of  reactor  safety.  The 
salient  features  of  these  systems  are  also  included. 

Keywords.  Reactor  safety;  reactor  control;  control  and  instrumentation. 
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1.  Introduction 

Dhruva  is  the  latest  addition  to  the  family  of  nuclear  research  reactors  in  Trombay. 
Designed,  constructed  and  presently  being  commissioned  by  engineers  and  scientists  of 
the  Bhabha  Atomic  Research  Centre  (barc),  the  reactor  will  provide  a  thermal  neutron 
flux  of  180  trillion  neutrons  cm2/sec  at  the  rated  power  of  100  Mw(th).  This  high 
neutron  flux  makes  the  reactor  a  versatile  nuclear  facility  for  fundamental  as  well  as 
engineering  research,  for  conducting  prototype  tests  on  power  reactor  fuels  and 
materials,  and  also  for  production  of  a  variety  of  radioisotopes  to  be  used  in  industry, 
medicine  and  agriculture. 

Certain  special  features  have  been  incorporated  in  the  design  of  safety,  control  and 
instrumentation  systems  of  Dhruva.  These  features,  hitherto  untried  in  other  reactors  of 
India,  help  in  achieving  a  high  degree  of  reactor  safety  and  plant  availability.  The 
evolution  of  these  new  systems  and  their  design  features  are  presented. 

This  article  has  been  specially  prepared  for  commemorating  the  sixtieth  birthday  of 
Dr.  Raja  Ramanna,  who  has  been  a  great  source  of  inspiration  and  guidance  to  the 
Dhruva  Project  and  through  this  we  pay  our  tributes  to  this  great  nuclear  scientist. 
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2.  Brief  description  of  the  reactor 

Dhruva  is  a  natural  uranium  fuelled,  heavy  water  moderated  and  heavy  water  cooled 
reactor.  Fuel  elements  in  the  form  of  multipin  clusters  of  uranium  metal  clad  in 
aluminium  are  located  in  specially  designed  coolant  channels  made  of  zircaloy.  The 
stainless  steel  reactor  vessel  houses  the  coolant  channels  and  is  placed  in  a  concrete 
vault  filled  with  demineralised  light  water.  The  vault  acts  as  a  shield  against  nuclear 
radiation  emanating  from  the  reactor  core.  The  hot  heavy  water  carrying  the  heat 
generated  in  the  fuel  rods  is  cooled  by  demineralised  light  water  in  a  set  of  three  heat 
exchangers.  The  light  water  in  turn  transfers  its  energy  through  another  set  of  five  heat 
exchangers  to  sea  water,  the  sea  being  the  ultimate  heat  sink.  In  order  to  maintain  purity 
of  heavy  water  the  free  space  in  the  reactor  vessel  is  filled  with  helium.  Reactor  power  is 
controlled  through  suitable  adjustment  of  moderator  level.  Provision  has  been  made 
for  on-power  fuelling  of  the  core  through  a  specially  developed  fuelling  machine.  A 
simplified  process  flow  diagram  is  shown  in  figure  1  and  table  1  indicates  the  salient 
features  of  the  reactor. 


3.  Safety  strategy 

The  safety  strategy  adopted  for  Dhruva  has  evolved  out  of  the  general  principles  of 
reactor  safety  normally  followed  in  the  design  of  nuclear  power  plants. 


Figure  1.  Simplified  process  flow  diagram  of  Dhruva. 
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Table  1.  Salient  features  of  Dhruva 


Item 

Description 

Reactor  type 

Thermal  research  reactor 

Reactor  power 

100  mw  (thermal) 

Thermal  neutron  flux 

1.8  x  1014  neutrons/cm2-sec 

Average  specific  power 

15.2  kW/kg  of  uranium 

Fuel  assembly 

7  pin  clusters  of  natural  uranium  metal  clad 
in  aluminium 

No.  of  fuel  assemblies 

129 

Moderator  and  reflector 

Heavy  water 

Primary  coolant 

Heavy  water 

Secondary  coolant 

Light  water 

Ultimate  heat  sink 

Sea  water 

Shielding 

Light  water  followed  by  high  density 
concrete 

Reactor  control  method 

Adjustment  of  moderator  level  in  reactor 
vessel 

Primary  shutdown  system 

Gravity  drop  of  cadmium  shutoff  rods  in 
the  core 

Emergency  shutdown  system 

Injection  of  gadolinium  solution  in  in-core 
tubes 

Additional  shutdown  system ' 

Dumping  of  moderator 

Refuelling 

On-power 

Reactor  block  dimension 

11.05  metres  in  diameter 

11.94  metres  in  height 

Reactor  vessel  dimension 

3.76  metres  in  diameter 

4.19  metres  in  height 

3.1  General  principle 

The  primary  goal  of  reactor  safety  is  to  prevent  any  significant  release  of  radioactive 
material  from  a  nuclear  facility  to  the  general  public  outside  the  plant  (Thompson  and 
Beckerley  1964).  In  order  to  achieve  this  goal  a  defence-in-depth  approach  is  normally 
adopted  for  the  design  of  a  nuclear  reactor.  This  design  approach  involves  two  parallel 
principles.  It  is  first  assumed  that  a  serious  accident  can  take  place  and,  therefore,  means 
must  be  found  to  prevent  the  release  of  fission  products  out  of  the  plant.  This  leads  to 
the  construction  of  multiple  barriers  against  radioactive  release  like  fuel  sheath, 
primary  heat  transport  system,  containment  etc.  The  second  principle  demands  that 
accidents  must  not  take  place.  This  results  in  specially  developed  safety  systems  totally 
dedicated  to  accident  prevention.  These  ensure  that  the  reactor  is  automatically 
shutdown  whenever  minor  plant  upsets  tend  to  become  major.  These  also  provide  long¬ 
term  cooling  to  the  fuel  elements  and  contain  potential  release  of  radioactivity. 

The  above  features  namely,  provision  of  safety  systems  which  prevent  accidents  in 
the  first  place,  and  multiple  barriers  to  radioactive  release  which  ensure  public  safety 
even  in  the  event  of  a  serious  accident,  have  been  incorporated  in  all  reactors  of  India 
and  they  are  largely  responsible  for  the  excellent  record  of  reactor  safety  maintained  in 
this  country.  However,  constant  effort  for  continuous  improvement  in  the  state  of  the 
art  is  an  interesting  feature  of  nuclear  technology.  The  search  for  further  improvement 
in  reactor  safety  leads  to  the  conclusion  that  defence-in-depth  approach  should  be 
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further  extended  to  the  design  of  the  overall  safety  system  itself.  This  has  been  tried  in 
the  Dhruva  reactor. 

3.2  Defence-in-depth  in  safety  system  design 

Defence-in-depth  design  approach  implies  a  need  to  ensure  that  accidents  will  be 
prevented  even  if  any  individual  safety  system  fails  to  perform  its  safety  functions.  In 
order  to  meet  this  requirement  it  will  be  useful  to  have  a  reactor  concept  which  by  virtue 
of  its  intrinsic  characteristics  prevents  accidents  or  minimises  their  consequence.  It  is 
also  necessary  to  have  sufficient  redundancy  in  the  safety  systems  so  that  failure  of  any 
individual  system  does  not  diminish  the  protection  capability  of  the  overall  system. 
Since  recent  experience  shows  that  identical  systems  can  fail  simultaneously  due  to 
common-mode  faults  (Moore  and  Hanauer  1972),  redundant  safety  systems  should  be 
based  on  diverse  principles  of  operations.  Defence-in-depth  philosophy  also  demands 
that  individual  safety  systems  must  independently  perform  their  safety  functions 
whenever  needed.  This  requirement  of  high  availability  can  be  largely  met  by  employing 
fail-safe  design  philosophy.  However,  it  is  to  be  recognised  that  an  absolute  fail-safe 
feature  is  nearly  impossible  to  achieve  in  practice.  Preparedness  for  unsafe  failures  is, 
therefore,  an  important  aspect  of  defence-in-depth  safety  strategy.  An  on-line  fault¬ 
checking  facility  helps  in  immediate  detection  of  unsafe  failures  and  this  feature,  if 
incorporated,  will  improve  the  protection  capability  of  safety  systems. 

The  safety  strategy  adopted  for  Dhruva  thus  departs  from  the  traditional  approach  in 
its  special  emphasis  on:  (i)  intrinsic  safety  features  of  the  plant,  (ii)  diversely  functioning 
safety  systems  and  (iii)  continuous  on-line  fault-detection  facility. 


4.  Special  safety  features 

The  primary  cause  of  practically  all  reactor  accidents  is  a  mismatch  between  the  energy 
liberation  rate  within  the  core  and  the  energy  removal  rate  of  the  coolant.  Accidents  can 
be  prevented  to  a  great  extent  if  reactor  power  is  always  restricted  within  the  heat 
transport  capability  of  the  cooling  systems.  The  disturbances  which  tend  to  upset  the 
power-coolant  balance  are  many  and  can  be  classified  under  two  broad  categories. 
These  are:  (i)  externally  or  internally  induced  reactivity  transients  (ii)  failures  of  the  heat 
transport  systems.  Since  the  possibility  of  a  major  power-coolant  mismatch  cannot  be 
ruled  out,  it  is  necessary  to  ensure  a  positive  reactor  shutdown  during  such  an 
occurrence. 

For  Dhruva  the  inherent  characteristics  of  the  plant  as  well  as  the  special  features 
incorporated  in  the  reactor  shutdown  systems  help  in  achieving  this  goal. 

4.1  Intrinsic  characteristics 

A  sudden  loss  of  coolant  from  the  reactor  core  is  the  most  severe  of  all  credible  causes  of 
a  major  power-coolant  imbalance.  Although  a  failure  of  this  kind  is  highly  improbable 
in  a  nuclear  plant,  public  safety  needs  to  be  ensured  even  for  such  eventualities.  Loss  of 
coolant  accident  (loca)  poses  a  serious  safety  problem  due  to  its  adverse  impact  on 
both  reactor  power  and  the  heat  transport  system.  This  situation  not  only  leads  to  a 
total  loss  of  heat  transport  capability  but  also  tends  to  increase  reactor  power  due  to  the 
resultant  positive  reactivity  effect.  It  has  been  estimated  in  the  case  of  Dhruva  that  the 
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gain  in  reactivity  on  a  total  coolant  loss  can  be  as  high  as  8.7  milliK.  Undesirable 
though  it  is  from  the  safety  standpoint,  positive  reactivity  feedback  from  a  loca  is  an 
inherent  nuclear  characteristic  of  all  pressure  tube  type  of  reactors.  This  nuclear 
limitation  has  been  overcome  in  the  Dhruva  reactor  by  an  ingenious  engineering  design 
of  the  primary  coolant  system. 

The  primary  coolant  circuit  of  Dhruva  is  divided  into  three  independent  loops,  each 
consisting  of  an  operating  pump,  a  heat  exchanger,  valves  and  interconnecting  piping. 
Figure  2  shows  a  simplified  flow  diagram  of  a  single  loop.  It  may  be  seen  that  cool  heavy 
water  from  the  inlet  plenum  flows  upwards  through  the  coolant  channels  containing 
the  reactor  fuel  and  then  joins  the  outlet  header.  Each  of  the  coolant  loops  draws  hot 
heavy  water  from  the  header  and  after  cooling  in  the  respective  heat  exchangers 
discharges  the  same  to  the  inlet  plenum,  thus  completing  the  cycle.  A  part  of  the  cool 
heavy  water  at  heat  exchanger  outlet  is  diverted  to  the  top  of  reactor  vessel  for  cooling 
the  top  tube  sheets  and  other  internal  structural  assemblies.  This  heavy  water  after 
cooling  the  internal  components  of  the  reactor  vessel  falls  on  the  free  surface  of  the 
moderator.  Thus  at  every  instant  heavy  water  from  the  closed  main  coolant  loop  enters 
into  the  free  moderator  region  of  the  vessel.  In  order  to  maintain  dynamic  equilibrium, 
it  is  necessary  to  return  an  equal  quantity  of  heavy  water  from  the  moderator  to  the 
coolant  system.  This  has  been  achieved  by  connecting  the  reactor  vessel  to  the  suction 
side  of  the  coolant  pump  in  each  loop  by  suitably  sized  lines.  In  this  arrangement  the 
reactor  vessel  acts  as  a  surge  tank  for  the  coolant  and  consequently  moderator  becomes 
the  source  of  coolant  make-up  during  a  loca.  Thus  a  break  in  the  coolant  circuit 
automatically  increases  the  moderator  return  flow  and  loss  of  coolant  heavy  water  is 
continuously  replenished  by  moderator  heavy  water.  It  is  clear  that  fuel  cooling  will 
continue  so  long  as  moderator  inventory  in  the  reactor  vessel  is  not  completely  lost.  The 
integrated  coolant-moderator  circuit  of  Dhruva  ensures  that  a  loss  of  coolant  is 
necessarily  preceded  by  a  total  loss  of  moderator  which  results  in  an  automatic  reactor 
shutdown.  Also,  a  rupture  in  the  coolant  circuit  does  not  immediately  affect  the  heat 
transport  capability  of  the  cooling  system  and  this  provides  sufficient  time  for  actuation 


Figure  2.  Simplified  primary  coolant  flow  diagram  (single  loop). 
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of  the  emergency  core  cooling  system.  These  two  factors  enable  preservation  of  fuel 
integrity  even  after  a  double  ended  rupture  of  the  largest  coolant  pipe,  loca  in  Dhruva , 
therefore,  does  not  pose  any  problem  of  fission  product  release  even  within  the  plant. 

4.2  Emergency  shutdown  system 

?  t 

The  provision  of  two  diversely  functioning  high  speed  shutdown  systems,  either  of 
which  is  capable  of  causing  a  rapid  reactor  shutdown  for  the  entire  spectrum  of  credible 
accidents,  is  a  special  feature  of  Dhruva.  The  primary  shutdown  system  utilises  the 
traditional  method  of  inserting  a  set  of  neutron  absorber  elements,  called  shutoff  rods, 
into  the  reactor  core.  There  are  nine  shutoff  rods  in  Dhruva  which  use  cadmium  as  the 
neutron  absorbing  material.  The  drive  mechanisms  of  these  rods  are  of  electro¬ 
mechanical  type  and  follow  the  popular  clutch  and  drum  arrangement.  In  this  design, 
the  rod  is  attached  to  a  steel  wire  rope  which  in  turn  is  wound  on  a  drum.  An 
electromagnetic  clutch,  if  energised,  connects  the  drum  to  a  motor  through  an 
irreversible  gear  train.  When  the  reactor  is  in  operation,  the  rod  is  parked  above  the  core 
region  and  is  held  in  position  by  the  energised  cluth.  On  a  reactor  shutdown  signal,  the 
clutch  is  deenergised  and  the  rod  drops  freely  into  the  core.  Although  the  shutoff  rod 
assemblies  are  independent  of  one  another,  they  are  identical  in  design  and 
construction.  Thus,  in  spite  of  built-in  redundancy  the  primary  shutdown  system  is 
vulnerable  to  common-mode  failures.  To  guard  against  such  failures  a  diversely 
functioning  emergency  shutdown  system  has  been  provided.  This  system,  specially 
developed  for  Dhruva ,  injects  a  neutron  absorbing  liquid  solution  (poison)  into  a  set  of 
empty  tubes  located  in  the  reactor  core. 

4.2a  System  description  The  system,  shown  in  figure  3  comprises  essentially  of: 


Figure  3.  Emergency  shutdown  system. 
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(i)  twenty  tubes  passing  through  the  reactor,  termed  as  poison  tubes,  and  the 
connecting  headers  at  top  and  bottom  ends;  (ii)  a  50  litre  capacity  liquid  poison  tank 
containing  solution  of  gadolinium  nitrate  in  demineralised  water,  (iii)  a  60  litre 
capacity  nitrogen  tank;  (iv)  a  20  litre  capacity  buffer  tank;  (v)  a  400  litre  capacity 
poison  storage  tank;  and  (vi)  nitrogen  cylinders,  a  pump,  connecting  pipes  and  valves. 

The  poison  tank  containing  liquid  neutron  absorber  is  connected  to  the  lower  header 
and  also  to  the  nitrogen  tank  through  three  fast-acting  solenoid  valves  (scram  valves) 
operating  in  parallel.  The  nitrogen  tank  is  maintained  at  a  pressure  of  1 5  kg/cm2  (g)  by 
the  nitrogen  cylinders.  The  upper  header  is  connected  to  the  buffer  tank  which  in  turn  is 
connected  to  the  poison  tank  through  two  fast  acting  solenoid  valves  (equaliser  valves) 
operating  in  parallel.  Two  more  fast  acting  solenoid  valves  (vent  valves)  operating  in 
series,  connect  the  poison  tank  to  an  air  plenum  which  is  always  maintained  at  near 
atmospheric  pressure.  These  valves  play  an  important  role  during  the  poison  injection 
process.  Other  valves  in  the  system  facilitate  post-injection  operations  like  dumping, 
washing  and  refilling  of  poison,  and  they  remain  in  their  deenergised  fail-safe  states 
prior  to  and  during  the  injection  process. 

4.2 b  Principle  of  operation  When  the  reactor  is  operating,  the  liquid  level  in  the 
poison  tubes  is  maintained  below  the  reactor  vessel  by  applying  suitable  nitrogen 
pressure  to  the  buffer  tank  and  venting  the  poison  tank  to  the  air  plenum.  In  this 
condition  the  scram  and  equaliser  valves  are  in  energised  close  states  and  the  vent  valves 
are  in  energised  open  states.  On  an  emergency  shutdown  signal  these  valves  are 
deenergised,  thus  pressurising  the  liquid  in  poison  tank.  This  initiates  rapid  injection  of 
poison  solution  into  the  poison  tubes.  The  time  required  to  fill  the  tubes  is 
approximately  0-8  second.  The  high  velocity  liquid  front  is  damped  by  the  compressed 
nitrogen  in  the  buffer  tank  which  creates  a  counter  pressure  on  the  moving  mass.  The 
motion  is  found  to  be  oscillatory.  However,  the  oscillation  is  contained  well  above  the 
active  core  and  as  such  there  is  no  variation  in  reactivity. 

In  order  to  ensure  a  high  degree  of  stability  of  the  poison  level  in  the  reactor 
operating  mode,  a  novel  feature  utilising  the  principle  of  pressure  feedback  has  been 
incorporated  in  the  system.  It  may  be  seen  from  figure  3  that  the  upper  header  pressure 
is  fed  back  to  the  so-called  ‘bubble  tube’  through  a  feedback  line.  The  junction  of  these 
lines,  marked  as  point  ‘x’  in  the  figure,  is  kept  at  the  same  elevation  as  the  desired  level  in 
the  poison  tubes.  The  normal  levels  in  the  poison  tank  and  the  poison  tubes  have  an 
elevation  difference  of  54  meters.  Thus  a  differential  pressure  equivalent  to  5.4  meters 
of  solution  head  is  to  be  maintained  between  the  tubes  and  the  tank.  The  upper  header 
is,  however,  kept  at  a  pressure  somewhat  higher  than  the  above  requirement,  so  as  to 
maintain  a  steady  flow  of  gas  through  the  pressure  feedback  line  and  the  bubble  tube.  A 
rise  in  the  poison  tube  level  reduces  the  gas  flow  rate,  thereby  increasing  the  pressure  in 
the  poison  tubes.  This  immediately  pushes  down  the  liquid  to  the  original  elevation. 
Similarly,  a  fall  in  the  level  increases  the  gas  flow  rate.  This  reduces  the  pressure  in  the 
poison  tubes  which  again  restores  the  original  level. 

4.2c  Post-injection  operations  During  the  poison  injection  process  the  nitrogen 
supply  system  is  isolated  from  the  nitrogen  tank.  After  the  injection  is  completed,  vent 
valves  are  opened.  This  releases  the  pressure  built  up  in  the  system.  For  a  restart  of  the 
reactor,  the  poison  from  the  core  is  brought  back  to  the  storage  tank.  The  poison  tubes 
are  then  flushed  with  demineralised  water  to  remove  any  residual  poison  that  may  stick 
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to  the  tube  walls.  After  the  completion  of  flushing,  the  system  is  once  again  filled  with 
poison. 

4.2 d  Performance  under  failure  The  dynamics  of  the  poison  injection  process  was 
analysed  through  a  detailed  mathematical  model  which  was  verified  by  actual  tests  on  a 
scaled  prototype  of  the  system.  The  process  simulation  was  carried  out  on  the  BESM-6 
computer  of  barc.  Using  the  model,  the  behaviour  of  the  system  under  normal 
condition  as  well  as  under  various  modes  of  failure  were  determined.  This  study  reveals 
that  poison  can  be  injected  in  less  than  one  second  even  if  two  of  the  three  scram  valves 
and  three  of  the  four  equaliser  and  vent  valves  simultaneously  fail  to  operate. 
Therefore,  satisfactory  operation  of  one  scram  valve  and  any  of  the  vent  and  equaliser 
valves  ensure  adequate  system  performance.  Since  these  valves  operate  on  fail-safe 
principle,  the  probability  of  not  meeting  the  above  requirement  is  extremely  small  and 
the  same  is  estimated  to  be  less  than  1  in  106.  Even  in  the  case  of  three  scram  valves 
failing  simultaneously,  poison  is  still  injected  though  at  a  considerably  slower  rate.  The 
injection  time  under  such  failure  is  approximately  8  seconds.  Figure  4  shows  the  post¬ 
injection  rise  in  poison  level  as  a  function  of  time  for  the  normal  mode  as  well  as  the 
extreme  condition  of  all  scram  valves  failing. 

4.3  On-line  fault  detection  facility 

The  failures  in  a  safety  system  are  normally  classified  under  ‘safe’  and  ‘unsafe’ 
categories.  A  safe  failure  inadvertently  actuates  a  safety  system  thereby  causing  a 


Figure  4.  Poison  level  transients. 
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spurious  reactor  shutdown.  Although  such  failures  reduce  plant  availability  they  have 
no  adverse  effect  on  safety.  An  unsafe  failure  decapacitates  a  safety  system  and, 
therefore,  is  of  great  concern.  Periodic  functional  testing  of  safety  systems  helps  in 
reducing  the  probability  of  unsafe  failures.  The  fractional  dead  time  (D)  of  a  safety 
system  which  represents  the  fraction  of  operating  time  that  the  system  may  remain 
decapacitated  is  given  by 

D  =  \fj 

where  fu  is  the  unsafe  failure  rate  and  T  is  the  interval  between  two  successive  tests.  It  is 
clear  that  greater  the  frequency  of  tests  lesser  is  the  probability  of  an  unsafe  failure. 
Manual  testing  of  systems  involves  considerable  time  and  effort  on  the  part  of  the 
operators  and  makes  it  difficult  to  achieve  rapid  rates  of  testing.  High  speed  automatic 
test  facilities  are,  therefore,  required  to  overcome  the  problem  of  unsafe  failures.  This 
has  been  attempted  in  some  of  the  critical  areas  of  the  Dhruva  safety  system. 

The  fine  impulse  test  (fit)  facilities  of  Dhruva  continuously  search  for  unsafe  failures 
in  the  safety  logic  systems  which  generate  safety  signals  and  actuate  safety  mechanisms. 
Each  safety  function  is  generated  in  three  independent  logic  channels  and  the  relevant 
mechanism  is  actuated  whenever  two  or  more  of  these  channels  demand  a  safety  action. 
The  fit  system  is  based  on  a  8-bit  Intel  microprocessor.  It  may  be  seen  from  figure  5 
which  represents  the  system  block  diagram,  that  fine  impulses  are  injected  into  each 
logic  channel  as  simulated  input  signals.  These  pulses  represent  the  conditions  which 
will  prevail  during  an  emergency.  Sufficiently  short  pulses  used  for  this  purpose  ensure 
that  the  output  devices  do  not  respond  to  these  stimuli.  The  outputs  of  the  logic 
channels  for  the  simulated  inputs  are  read  by  the  central  processing  unit  (cpu)  through 
digital  input  data  cards  and  get  checked  for  any  possible  fault.  When  faults  are 
encountered,  the  system  analyses  the  faults  to  the  card  level  and  displays  the  address  of 


Figure  5.  Fine  impulse  test  facility. 
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the  defective  cards  on  the  display  panel.  The  system  takes  less  than  one  second  to  check 
all  components  under  its  purview.  A  watch  dog  timer  and  other  self-checking  facilities 
have  been  incorporated  in  the  system  to  improve  its  reliability. 

The  fit  facilities  of  Dhruva  can  carry  out  checks  a  million  time  faster  than  the  manual 
testing  rate  normally  followed  in  operating  plants.  Since  the  probability  of  a  failure  is 
inversely  proportional  to  the  frequency  of  testing,  these  facilities  provide  an  extremely 
effective  protection  against  unsafe  failure. 


5.  Control  and  instrumentation  systems 

The  control  and  instrumentation  (C  &  I)  systems  of  Dhruva  are  designed  to  achieve  a 
high  degree  of  automatation  in  plant  operation.  This  reduces  the  routine  work  load  of 
the  operating  staff,  providing  increased  opportunity  to  concentrate  on  unusual 
occurrences.  The  C  &  I  systems  also  assist  the  operator  in  high  level  monitoring  of  plant 
performance,  thereby  enhancing  both  safety  and  availability  of  the  reactor. 

5.1  Automatic  power  control  system 

Reactor  power  in  Dhruva  is  regulated  through  suitable  variation  of  moderator  level  in 
the  reactor  vessel.  As  shown  in  figure  6,  the  moderator  level  is  changed  by  varying  the 
rate  of  flow  of  heavy  water  into  and  out  of  the  vessel.  The  speed  of  the  moderator  pumps 
in  the  inflow  line  and  the  opening  of  control  valves  in  the  outflow  line  are  controlled  to 
achieve  the  required  flow  variation.  This  method  of  reactor  control  offers  certain  safety 
advantages.  It  enables  rapid  draining  of  moderator  from  the  core  by  opening  of  the 
control  valves  and  tripping  of  the  moderator  pumps,  thereby  causing  a  reactor 
shutdown.  The  control  system  can,  therefore,  be  used  as  an  additional  shutdown  system 
of  the  reactor. 

The  dual  control  of  both  inflow  and  outflow  rates  of  heavy  water  is  a  special  feature 
of  Dhruva.  This  scheme  results  in  low  equilibrium  flow  in  the  core  which  restricts  the 


Figure  6.  Automatic  power  control  system. 
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flow-induced  vibrations  of  in-core  components.  This  also  reduces  the  power  consump¬ 
tion  by  the  pumps.  Added  to  these  the  large  reactivity  range  of  this  scheme  enhances  the 
control  capability  of  the  system. 

To  achieve  high  availability  and  adequate  safety,  triplicated  control  channels  are 
provided.  Each  channel  is  essentially  independent  of  the  other  two  and  controls  one 
valve  and  one  pump.  The  system  is  so  designed  that  control  capability  is  preserved  even 
on  failure  of  one  complete  channel.  However,  the  possibility  of  such  failures  is  low  since 
the  functioning  of  the  various  sub-systems  is  continuously  monitored  by  a  number  of 
signal  comparator  units  incorporated  for  this  purpose.  The  control  signal  for  the  pump 
or  the  valve  is  derived  through  suitable  processing  of  the  power  demand  signal  set  by 
the  operator,  and  the  feedback  signals  provided  by  the  neutron  power  detecting 
instruments. 

The  system  design  was  optimised  through  a  parametric  study  of  the  system 
behaviour  under  various  steady  state  and  transient  conditions.  Detailed  mathematical 
models  were  developed  for  this  purpose.  The  reactor  was  described  by  the  point  kinetics 
equations  with  six  groups  of  delayed  neutrons.  Accelerations  in  fluid  flow  were 
calculated  through  incompressible  one-dimensional  fluid  flow  equations.  Individual 
models  were  also  developed  for  the  electronic  units,  moderator  pumps  and  control 
valves.  The  integrated  model  was  found  to  be  highly  nonlinear  with  29  state  variables. 
The  steady  state  stability  of  the  system  which  is  valid  for  small  perturbations,  was 
evaluated  through  the  linearised  version  of  the  model.  It  was  established  that  the 
control  system  was  stable  with  adequate  gain  and  phase  margins.  The  transient 
performance  was,  however,  analysed  with  the  help  of  a  detailed  computer  code  which 
included  all  the  non-linearities  of  the  model. 


5.2  Wide  range  monitoring  of  reactor  power 

Neutron  flux  which  is  proportional  to  reactor  power,  is  normally  measured  by 
processing  a  neutron  detector  output  in  either  pulse  mode  or  d.c.  mode.  In  pulse  mode 
individual  pulses  from  the  detector  are  counted  and  the  count  rate  represents  neutron 
flux  or  reactor  power.  This  technique  is  employed  for  low  level  flux  upto  a  maximum  of 
106  nv.  At  high  flux  values  individual  pulses  overlap  and  the  resultant  current  is 
measured  in  the  d.c.  mode.  The  d.c.  current  is  proportional  to  neutron  flux  and  hence, 
reactor  power.  In  recent  years,  a  third  technique  for  the  measurement  of  neutron  flux 
has  been  developed.  This  is  based  on  the  noise  theories  of  Campbell.  Applying  this 
theory  it  can  be  shown  that  the  mean  square  of  the  current  from  a  source  of  random 
current  pulses  is  proportional  to  the  pulse  rate  and  square  of  the  pulse  height,  provided 
that  the  d.c.  portion  of  the  current  is  cut  off  by  capacitive  coupling  and  the  fluctuating 
a.c.  component  is  used  exclusively.  Therefore,  the  mean  square  of  a.c.  components  of 
the  detector  current  is  proportional  to  neutron  flux.  The  Campbell  technique  provides 
certain  advantages.  Since  only  the  variations  in  current  are  measured,  d.c.  leakage 
currents  of  cables  do  not  result  in  significant  error.  Also,  the  mean  square  output  being 
proportional  to  the  square  of  pulse  height  smaller  pulses  contribute  less  to  the  total 
signal,  thereby  improving  the  gamma  discrimination  capability  of  the  system.  A  proper 
integration  of  pulse  and  Campbell  techniques  enables  wide  range  monitoring  of  reactor 
power  through  a  single  neutron  detector.  Such  a  system  has  been  developed  for  Dhruva. 
This  system,  shown  in  figure  7,  monitors  reactor  power  over  a  range  of  10  decades  using 
a  single  fission  chamber.  It  has  been,  thus,  possible  to  use  a  few  detectors  and  yet  have 
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Figure  7.  Integrated  Campbell-pulse  channel 


wide  range  monitoring  capability.  A  ten  decade  coverage  of  reactor  power  by  the 
regular  instrumentation  system  is  a  special  feature  of  Dhruva  and  this  enables  normal 
reactor  start-up  even  after  a  very  long  shutdown  without  the  help  of  additional 
instrumentation. 

5.3  Instrumentation  for  operator  support 

A  number  of  microprocessor-based  systems  assist  the  operator  in  high  level  monitoring 
of  plant  performance.  These  enable  early  detection  of  minor  plant  upsets  and  guide  the 
operator  in  taking  corrective  actions.  The  data  acquisition  and  display  system,  coolant 
channel  surveillance  systems  and  alarm  annunciator  system  are  provided  for  this 
purpose.  These  systems  were  specially  developed  to  meet  the  high  reliability 
requirements  of  reactor  instrumentation. 

The  data  acquisition  and  display  system  is  configured  around  a  16-bit  Intel 
microprocessor  and  employs  two-colour  cathode  ray  tubes  (crts)  for  data  display.  The 
system  exploits  the  advantages  offered  by  colour  CRT  terminals  and  presents  plant  data 
in  easy-to-grasp  formats  like  mimics,  graphs,  tables,  bar  charts  etc.  This  leads  to 
enhancement  of  opera  tor’s  comprehension  of  plant  beha  viour  and  therefore  to  a  better 
man-machine  interaction.  The  functions  carried  out  by  the  system  include  alarm 
annuciation  with  detailed  instructions  for  operator  action,  display  of  plant  status  in 
mimic  and  tabular  forms,  system  performance  calculations  and  graphic  presentation  of 
the  trend  in  various  process  variables  during  routine  transients  like  start-up,  manual 
shutdown,  raising  or  lowering  of  power  etc.  Facilities  are  also  available  for  routine  or 
demand  logs  of  plant  parameters. 

The  coolant  channel  surviellance  systems  monitor  the  health  of  fuel  elements  in  the 
reactor  core.  The  coolant  flow,  outlet  temperature  and  radioactivity  level  in  each 
coolant  channel  are  continuously  monitored  by  three  independent  microprocessor- 
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based  systems  and  the  relevant  information  is  displayed  in  the  control  room.  Whenever 
any  of  these  parameters  exceeds  the  operational  limit  the  system  provides  an  alarm  and 
indicates  the  address  of  the  affected  coolant  channel.  In  the  normal  mode  process  status 
is  displayed  on  a  demand  by  the  operator. 

The  alarm  annunciator  system  provides  audio-visual  warning  to  the  operator  when 
minor  perturbations  occur  in  operating  parameters.  In  order  to  achieve  high  reliability 
a  dual  computer  scheme  has  been  adopted  for  this  system.  In  this  scheme  each 
computer  is  entirely  capable  of  performing  all  functions  of  the  annunciator.  Only  one 
computer  system  is  normally  active  and  the  second  one  is  a  standby.  In  the  event  of  a 
malfunction  of  the  active  computer  system  the  control  is  automatically  transferred  by 
the  watchdog  to  the  standby  system. 

Extensive  self-checking  features  have  been  incorporated  in  all  computerised  systems 
of  Dhruva  and  they  are  responsible  for  the  high  reliability  of  these  systems.  Apart  from 
the  reduction  in  size  and  cost,  these  systems  offer  an  additional  advantage  of  flexibility 
which  facilitates  system  modification  during  commissioning. 

6.  Control  room 

The  control  room  provides  all  the  information  and  controls  required  for  safe  operation 
of  the  plant.  Considerable  efforts  were  made  to  evolve  an  optimum  design  of  the  control 
room,  especially  the  sizing  and  layout  of  panels  and  consoles  as  well  as  the  arrangement 
of  control  and  display  devices.  The  main  emphasis  was  on  achieving  an  optimum 
integration  of  the  operator  with  the  process  through  appropriate  human  factor 
engineering.  This  was  considered  to  be  essential,  since  many  of  the  abnormal 
occurrences  in  the  operating  plants  can  be  attributed  to  operator  errors  resulting  from 
deficiencies  in  control  room  design. 

In  view  of  the  multitude  of  tasks  to  be  performed  from  the  control  room,  it  is 
necessary  to  have  functional  grouping  of  different  tasks.  The  Dhruva  control  room 
consisting  of  nine  panels  and  one  console  having  a  total  length  of  approximately  30 
meters  can  be  subdivided  into  three  functional  areas.  Routine  reactor  start-up  and 
shutdown  is  carried  out  at  the  console.  Reactor  cooling  and  other  process  systems 
which  are  unaffected  by  reactor  shutdown,  are  controlled  from  the  central  five  panels. 
The  extreme  panels,  two  on  each  side,  provide  the  information  which  is  to  be 
periodically  logged  by  the  operator. 

In  order  to  facilitate  identification  of  different  controls,  operational  mimic  diagrams 
are  extensively  used.  These  diagrams  display  the  flow  of  a  process  medium  in  a  system 
and  include  the  major  components  like  pumps,  valves,  tanks,  etc,  which  are  represented 
by  symbols.  The  control  switches  of  an  equipment  are  incorporated  within  the 
corresponding  symbol.  These  switches  are  of  lighted  push  button  type  and  thus 
equipment  status  is  always  indicated  on  the  corresponding  symbol.  This  enables  the 
operator  to  immediately  determine  the  effect  of  a  control  action  taken  by  him.  As  a 
further  aid  to  the  operator  the  status  of  relevant  process  variables  are  displayed  through 
suitable  indicators.  Mimic  diagrams  are  structured  in  different  colours,  each  colour 
representing  a  specific  system.  This  helps  in  highlighting  different  systems. 

Two  colour  crts  located  on  the  console  display  detailed  information  on  the  plant 
status.  This  facility  enables  the  operator  to  gather  all  data  from  a  single  instrument. 
Individual  instruments  like  recorders  and  meters  also  provide  standby  indications. 
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7.  Conclusion 

The  safety  and  control  systems  of  Dhruva  were  entirely  designed,  developed  and 
fabricated  at  barc.  The  special  features  of  these  systems  reflect  the  current  trends 
towards  diversified  safety  systems,  increased  use  of  computer  power  in  C  &  I  functions, 
enhancement  of  existing  instrumentation  capability  and  improved  man-machine 
interaction  through  advanced  control  centres.  These  concepts  are  likely  to  be  employed 
in  future  nuclear  plants  of  our  country. 
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Physics  and  engineering  aspects  of  fast  reactor  safety 
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Abstract.  A  vast  amount  of  research  and  development  work  has  been  done  in  recent  years  to 
resolve  the  issues  of  relevance  to  the  safety  of  the  Liquid  Metal  cooled  Fast  Breeder  Reactor 
(lmfbr).  Based  on  the  results  of  this  research  as  well  as  on  the  experience  gained  from  the 
operation  of  test  and  prototype  lmfbr’s,  a  certain  consensus  is  emerging  on  the  safety 
requirements  of  a  modern  sodium-cooled  large  fast  power  reactor.  The  paper  reviews  the 
fundamental  physics  and  engineering  aspects  of  lmfbr  safety  with  reference  to  the  Fast 
Breeder  Test  Reactor  (fbtr)  now  being  commissioned  at  Kalpakkam,  and  the  proposed  larger 
Prototype  Fast  Breeder  Reactor  (pfbr).  The  elements  contributing  to  the  inherent  safety  of 
fast  reactors  are  recapitulated  followed  by  description  of  the  philosophy  of  the  plant 
protection  system  and  the  use  of  engineered  safeguards  to  enhance  the  safety.  Finally,  the 
principles  used  for  the  containment  of  radioactivity  are  discussed. 

Keywords.  Fast  breeder  reactors;  safety  requirements;  engineered  safeguards;  containment 
of  radioactivity. 

PACS  No.  28  50;  28  45 


1.  Introduction 

The  fundamental  concerns  of  nuclear  reactor  safety  are  apparent  in  the  basic  nuclear 
fission  chain  reaction  itself.  In  this  reaction,  a  neutron  splits  the  nucleus  of  an  atom  of  a 
fissile  material  like  uranium  or  plutonium  into  two  smaller  radioactive  nuclei  (fission 
products).  At  the  same  time,  a  considerable  amount  of  energy  in  the  form  of  kinetic 
energy  of  the  fission  product  nuclei  is  released,  along  with  nuclear  radiations,  and 
several  additional  neutrons.  The  released  fission  neutrons,  which  are  emitted  in  a 
spectrum  of  energies  peaking  around  2  MeV,  can  induce  fission  in  further  atoms 
causing  a  chain  reaction,  which  process  is  used  for  heat  production  in  a  nuclear  reactor. 
It  is  immediately  seen  that  the  problems  of  safety  specific  to  nuclear  reactors  would 
have  their  origin  in  the  following: 

a)  control  of  the  chain  reaction  and  reactor  geometry  to  prevent  unchecked  growth  of 
neutron  population  that  can  cause  heat  production  in  excess  of  the  heat  removal 
capability, 

b)  shielding  of  the  nuclear  radiations  produced  during  the  chain  reaction, 

c)  containment  of  the  radioactive  fission  products,  shielding  of  their  nuclear  radiations, 
and  provision  of  cooling  to  remove  their  radioactive  decay  energy  when  the  reactor  is 
shut  down. 

To  appreciate  the  development  of  concepts  in  nuclear  reactor  safety,  it  is  necessary  to 
recall  the  events  that  have  occurred  in  the  short  period  of  time  that  has  elapsed  since  the 
discovery  of  the  fission  reaction  in  1939.  The  possibility  of  using  low  energy  neutrons 
for  fission  of  U235  and  the  need  for  high  energy  neutrons  for  fission  of  U238  or  thorium 
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was  shown  in  the  same  year,  as  also  the  existence  of  fission  neutrons  and  delayed 
neutrons.  This  was  followed  by  Fermi  attempting  and  failing  to  sustain  a  chain  reaction 
in  a  light  water  moderated  natural  uranium  assembly.  The  brilliant  theorising  and 
experimentation  of  Fermi’s  group  led  to  the  important  decisions  to  use  graphite  as 
moderator  as  well  as  to  lump  the  fuel  to  minimise  the  loss  of  neutrons  by  non-fission 
reactions  in  U238.  This  led  to  the  successful  atomic  pile  cp  1  which  went  critical  on 
December  2,  1942.  The  reactor  used  6  tons  of  pure  uranium  metal,  50  tons  of 
compressed  uranium  dioxide  powder,  400  tons  of  graphite  and  a  number  of  control 
rods  consisting  of  cadmium  sheets  nailed  to  wood.  In  addition  to  shim  control  rods 
there  were  cocked  safety  rods,  which  could  be  released  to  fall  into  the  core 
automatically,  should  the  neutron  intensity  as  measured  by  a  BF3  gas  filled  ionization 
chamber  rise  too  high. 

After  this  historic  achievement,  several  other  reactors  were  constructed  in  the  usa 
driven  by  the  need  for  plutonium  for  weapons.  The  first  enriched  uranium,  light  water 
moderated  reactor  was  built  in  1944.  The  first  atomic  bomb  (Trinity  test)  was  exploded 
on  16th  July  1945  and  was  a  plutonium  device.  This  was  followed  21  days  later  by  the 
Hiroshima  U235  bomb  and  a  further  3  days  later  by  the  Nagasaki  plutonium  bomb. 

Canada  achieved  its  first  chain  reaction  in  the  heavy  water  moderated  reactor  zeep  in 
1945,  uk  achieved  it  in  the  graphite  moderated  pile  gleep  in  1947,  France  in  the  heavy 
water  moderated  reactor  zoe  in  1948  and  India  achieved  its  first  chain  reaction  in  the 
apsara  reactor  in  1956. 

The  first  controlled  fast  chain  reaction  was  in  1946  in  the  25  kWt  clementine  fast 
reactor  of  usa  using  plutonium  fuel  and  mercury  coolant.  The  first  nuclear  electricity 
production  was  also  from  a  fast  reactor,  the  enriched  uranium  fuelled,  NaK  cooled  1 
MWt  ebr  1  which  went  critical  in  1951.  India  achieved  its  fast  chain  reaction  in  the 
purnima  reactor  in  1972. 

In  general,  the  safety  of  a  reactor  is  ensured  by  parallel  approaches  at  different  levels 
providing  a  ‘defense  in  depth’.  The  first  level  is  the  design  of  an  inherently  safe  plant 
with  an  adequate  control  system  which  can  operate  with  a  high  degree  of  reliability.  In 
addition,  special  emphasis  is  placed  on  the  quality  of  materials  and  workmanship  of 
components,  with  provision  for  continuous  or  periodic  inspection  of  components  and 
sub-systems.  The  careful  design  and  construction  is  followed  by  equally  careful 
operations  so  as  to  reduce  the  probability  of  accidents  to  a  very  low  level. 

At  a  second  level,  safety  is  provided  by  means  of  a  comprehensive  plant  protection 
system  (pps)  which  can  safely  handle  a  wide  range  of  conceivable  abnormal  incidents 
and  malfunctions  and  safely  shut  down  the  plant.  The  pps  includes  a  variety  of 
instruments  and  sensors  to  monitor  the  state  of  the  plant  and  to  take  protective  as  well 
as  sympathetic  control  actions  to  prevent  any  abnormality  leading  to  an  accident. 

The  third  level  of  safety  is  provided  by  engineered  safety  features  such  as  reactor 
containment  vessel  and  building,  secondary  shutdown  systems,  emergency  core  cooling 
system,  etc.,  which  limit  the  consequences  of  certain  highly  unlikely  accidents,  which  are 
postulated  to  occur  in  spite  of  the  first  and  second  level  safety  measures.  Finally,  a 
careful  choice  of  site  is  made,  and  exclusion  areas  defined  such  that  routine  as  well  as 
accidental  radioactive  releases  do  not  affect  the  general  population. 

In  the  following  discussion,  we  elaborate  on  these  elements  of  reactor  safety  with 
reference  to  the  fast  reactor.  The  discussion  will  start  with  physics,  control  and 
engineering  design  features  providing  the  inherent  safety  of  a  modern  fast  reactor,  then 
go  on  to  the  main  components  of  the  pps.  Finally,  the  principles  of  the  containment  of 
radioactivity  under  normal  and  accident  conditions  will  be  discussed. 
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The  40  MWt/13  MWe  loop  type  Fast  Breeder  Test  Reactor  (fbtr)  now  being 
commissioned  at  Kalpakkam  and  the  proposed  500  MWe  pool  type  Prototype  Fast 
Breeder  Reactor  (pfbr)  will  be  used  as  examples  during  the  discussion  and  their  safety 
features  will  be  highlighted  vis-a-vis  the  international  approaches  to  fast  reactor  safety. 


2.  The  fast  reactor 

2.1  Thermal  and  fast  chain  reactions 

The  relative  probabilities  of  the  production  of  neutrons  by  fission  in  a  reactor,  and  the 
loss  of  neutrons  by  other  nuclear  reactions  as  well  as  escape  from  the  reactor  govern  the 
possibility  of  maintaining  a  chain  reaction.  Of  all  the  naturally  occurring  isotopes  of 
elements,  only  U235  is  fissile  and  can  sustain  a  chain  reaction.  U235  has  an  abundance  of 
only  0-7%  in  natural  uranium,  the  rest  being  the  isotope  U238.  However,  the  U235 
content  can  be  artificially  increased  in  enrichment  plants  to  produce  enriched  uranium. 
In  order  to  sustain  a  chain  reaction  in  natural  uranium  or  low  enrichment  uranium,  it  is 
necessary  (see  for  example,  Glasstone  and  Edlund  1952)  to  reduce  the  energy  of  the 
fission  neutrons  by  means  of  scattering  collisions  with  suitable  light  moderator  atoms, 
till  the  neutrons  attain  low  energies  (around  025  eV)  at  which  they  are  in  thermal 
equilibrium  with  matter.  These  thermal  neutrons  are  then  allowed  to  cause  further 
fission  and  continue  the  chain  reaction. 

If  a  chain  reaction  is  to  be  maintained  in  a  reactor  without  a  moderator,  then  it  is 
necessary  to  use  high  enrichments  ranging  from  1 5  %  (for  large  reactors)  to  over  90  % 
(for  small  reactors).  Here  the  chain  reaction  is  maintained  by  the  fast  moving  fission 
neutrons.  There  are  several  advantages  of  the  fast  chain  reaction  over  the  thermal  one. 
The  first  is  that  fast  reactors  are  very  compact  on  account  of  the  absence  of  the  large 
bulk  of  moderator  which  is  required  in  thermal  reactors.  Fast  reactors  are  much  more 
compact  than  the  light  water  moderated  package  power  reactors  used  in  nuclear 
powered  sea-craft,  and  have  been  considered  for  space  craft  applications  (Hanson 
1984).  In  addition,  the  fast  chain  reaction  is  used  in  nuclear  explosives  applications  and 
in  pulsed  fast  reactors  providing  a  large  number  of  fast  neutrons  for  nuclear  research 
(Oakes  1968). 

However,  the  most  important  reason  for  the  large  scale  development  of  fast  reactors 
is  the  efficient  breeding  of  new  fissile  material,  plutonium  and  U233,  from  their 
respective  fertile  materials,  U238  and  thorium.  The  fertile  materials  are  several  hundred 
times  more  abundant  than  U235  and  can  be  converted  to  fissile  material  by  neutron 
absorption  in  a  reactor.  A  breeder  reactor  is  designed  such  that  it  generates  more  fissile 
material  from  fertile  material  than  the  fissile  material  it  consumes  for  energy 
production.  Without  breeder  reactors,  it  will  be  possible  to  use  only  a  small  fraction  of 
the  uranium  resources.  Hence  all  plans  for  the  long-term  utilisation  of  uranium  and 
thorium  resources  are  based  on  the  breeder  reactor.  While  there  are  designs  for  thermal 
breeder  reactors  using  the  Th/U233  fuel  cycle,  for  efficient  breeding  the  fast  chain 
reaction  is  the  best,  mainly  on  account  of  the  fact  that  the  number  of  neutrons  released 
per  absorption  in  fuel  is  higher  for  higher  neutron  energies. 

2.2  The  lmfbr 

There  are  different  kinds  of  fast  reactors  and  the  best  developed  is  the  Liquid  Metal 
cooled  Fast  Breeder  Reactor  (lmfbr).  In  this  type,  heat  is  generated  by  the  fission  chain 
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reaction  in  a  core  consisting  of  fuel  subassemblies  supported  vertically  on  a  grid  plate  in 
a  reactor  vessel.  Each  subassembly  is  a  bundle  of  fuel  pins  made  of  stainless  steel  clad 
tubes  of  small  diameter  (less  than  1  cm)  containing  the  enriched  fuel  pellets.  The  fertile 
material  in  the  fuel  as  well  as  on  top  and  bottom  of  the  fuel  (axial  blankets)  get 
converted  to  fissile  material  by  neutron  irradiation.  In  addition,  radially  surrounding 
the  core  are  blanket  subassemblies  containing  additional  fertile  material  for  conversion 
to  fissile  material. 

Water  is  a  neutron  moderator,  and  rapidly  decelerates  the  fast  moving  fission 
neutrons  to  less  than  the  energy  suitable  for  breeding,  hence  it  cannot  be  used  as  a 
coolant.  The  problem  of  removing  a  large  amount  of  heat  from  the  small  compact  core 
of  a  fast  reactor  by  a  coolant  which  does  not  reduce  the  neutron  energy  unacceptably  is 
solved  in  the  lmfbr  by  the  use  of  liquid  sodium.  This  coolant  has  excellent  heat  transfer 
characteristics  and  good  chemical  compatibility  with  fuel  and  structural  materials.  The 
reactivity  of  sodium  with  air  or  water  is  taken  care  of  by  providing  an  inert  cover  gas 
like  argon  over  free  sodium  surfaces.  In  early  small  experimental  lmfbrs,  mercury, 
potassium  and  a  eutectic  mixture  of  sodium  and  potassium  (NaK)  have  been  also  used 
as  coolants. 

The  sodium  coolant  is  pushed  by  pumps  through  the  core  and  gets  heated.  The  hot 
primary  sodium  is  radioactive  and  is  not  used  to  directly  produce  steam,  but  rather 
transfers  the  heat  to  secondary  sodium  through  intermediate  heat  exchangers  (ihx).  The 
non-radioactive  secondary  sodium  flows  through  sodium  heated  steam  generators  to 
produce  steam  which  drives  turbo-generators.  In  the  pool  type  layout,  the  whole 
primary  coolant  circuit  along  with  primary  pumps  and  ihxs  is  contained  within  the 
reactor  vessel  as  shown  in  figure  1 .  In  the  loop  type  layout,  the  primary  pumps  and 
ihxs  are  outside  the  reactor  vessel  (figure  2)  though  the  whole  radioactive  primary 
circuit  is  contained  within  the  reactor  containment  building.  Both  types  of  layout  are 
prevalent  though  they  have  differing  safety  implications. 

The  chain  reaction  is  usually  controlled  by  the  use  of  the  neutron  absorber  boron,  in 
the  form  of  boron  carbide  control  rods.  Fuel  loading  and  unloading  is  invariably  done 
under  shutdown  conditions,  from  the  top  of  the  reactor  vessel  by  means  of  fuelling 
machines  handling  one  subassembly  at  a  time. 
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Figure  1.  Schematic  diagram  of  a  pool  type  lmfbr. 
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Figure  2.  Schematic  diagram  of  a  loop  type  lmfbr. 


Table  1.  Some  characteristics  of  fbtr  and  pfbr. 


FBTR 

PFBR 

Reactor  thermal  power  (MWt) 

40 

1250 

Electrical  output  (MWe) 

13 

500 

Primary  circuit  concept 

Loop 

Pool 

Fuel  material 

Mixed  carbide 

Mixed  carbide  of 

of  Pu  and  U 

Pu  and  U 

Pin  outer  diameter  (mm) 

51 

8-8 

Active  core  height  (cm) 

32 

100 

Volume  of  active  core  (litres) 

46-7 

2739 

Maximum  neutron  flux  (ncm"2  sec-1) 

3-7  x  1015 

8-6  x  1015 

Effective  delayed  neutron  fraction 

0-0028 

00037 

Prompt  neutron  lifetime  (sec) 

2.3  x  10-7 

3.6  x  10-7 

Number  of  control  rods 

6 

12 

Inlet  sodium  temperature  (°C) 

380 

380 

Outlet  sodium  temperature  (°C) 

515 

530 

Number  of  primary  pumps 

2 

4 

The  main  characteristics  of  fbtr  and  of  pfbr  are  given  in  table  1 .  fbtr  is  a  mixed 
plutonium-uranium  carbide  fuelled  lmfbr  with  two  primary  loops,  pfbr  is  planned  to 
be  a  pool  type  lmfbr  using  either  mixed  carbide  or  mixed  oxide  fuel  and  incorporating 
state  of  the  art  safety  features. 


3.  Inherent  safety  features 

3.1  Neutron  population  and  multiplication  factor 

The  power  produced  in  a  nuclear  reactor  core  is  proportional  to  the  rate  of  fission 
which  is  proportional  to  the  product  of  the  fissile  atom  concentration  and  the  neutron 
population,  JV,  in  the  core.  The  neutron  population  level  arises  from  a  dynamic 
equilibrium  between  the  generation  of  neutrons  by  the  fission  of  fuel  atoms  and  the  loss 
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of  neutrons  by  absorption  in  fuel  and  other  atoms  and  by  leakage  from  the  core. 

A  fundamental  parameter  is  the  multiplication  factor,  /c,  which  is  the  ratio  of  the  rate 
of  generation  of  neutrons  to  the  rate  of  loss  of  neutrons.  The  fission  rate  and  reactor 
power  will  hence  continuously  increase,  decrease  or  be  constant  accordingly  as  k  is 
greater  than,  less  than  or  equal  to  unity  and  the  reactor  is  correspondingly  said  to  be 
super  critical,  subcritical  or  critical.  In  an  operating  reactor,  k  can  be  changed  within 
limits  by  movement  of  the  control  rods  which  changes  the  rate  of  absorption  of 
neutrons.  An  alternate  parameter  to  k  is  the  reactivity  p  defined  as,  p  =  (k  —  \)/k.  Study 
of  the  normal  and  abnormal  changes  to  the  core  reactivity  is  important  for  control  and 
safety. 

3.2  Prompt  neutron  lifetime  and  delayed  neutron  fraction 

The  prompt  neutron  lifetime,  /,  is  the  average  time  from  the  ‘birth’  of  a  neutron  in  the 
fission  of  a  fuel  atom  to  the  ‘death’  of  the  neutron  by  absorption  or  leakage.  In  this 
connection,  it  must  be  noted  that  over  95%  of  the  reactions  in  a  fast  reactor  are 
scattering  reactions  which  reduce  the  neutron  energy  and  prolong  the  neutron  life.  In  a 
fast  reactor,  /  is  of  the  order  of  1 0  ~ 7  sec  as  compared  to  of  the  order  of  1 0  ”  4  to  1 0  ” 3  sec 
for  thermal  reactors.  The  average  rate  of  loss  of  neutrons  is  N/l  and  the  rate  of 
production  is  kN/l.  Thus  the  growth  of  neutron  population,  for  constant  /c,  will  be 
exponential  with  a  time  constant  T  =  l/(k  —  1)  called  the  reactor  period. 

Normal  control  movements  make  (k  —  1)  of  the  order  of  10”4  to  10” 3  so  that  the 
period  would  be  of  the  order  of  10”  4  to  10” 3  sec  for  fast  reactors  and  of  the  order  of  a 
second  for  thermal  reactors.  These  are  quite  rapid  response  times  and  would  have  made 
reactor  control  problematic,  especially  in  the  case  of  fast  reactors.  However,  as 
explained  below,  the  presence  of  delayed  neutrons  increases  the  response  time  and 
makes  it  the  same  for  both  fast  or  thermal  reactors. 

A  small  fraction  ft  (delayed  neutron  fraction)  of  the  neutrons  produced  in  the  fission 
of  a  fuel  atom  are,  in  fact,  emitted  subsequent  to  the  fission  by  decay  of  certain 
radioactive  fission  products  (delayed  neutron  precursors)  with  mean  lives  ranging  from 
03  sec  to  80  sec.  ft  is  0  00204  and  the  average  mean  life  x  is  14-6  sec  for  Pu239  and 
the  values  differ  for  different  nuclides  (Keepin  1965).  When  delayed  neutrons  are 
included,  the  reactor  period  can  be  obtained,  when  k  is  near  unity,  by  replacing  the 
prompt  neutron  lifetime  l  by  a  weighted  mean  neutron  lifetime  /*  =  (1  —  /?)/  +  fir.  I*  is 
of  the  order  of 005  sec  and  hence  for  (k  —  1 )  of  the  order  ofl0~4tol0”3  sec,  a  period  of 
the  order  of  50  to  500  sec  will  result. 

There  are  two  more  important  effects  connected  with  the  delayed  neutrons: 

(i)  When  a  reactor  is  shut  down  by  insertion  of  control  rods  such  that  k  is  reduced  much 
below  1 ,  the  power  at  first  falls  rapidly  to  a  fraction  fi/(\  —  k  +  ft)  of  the  operating  power, 
then  falls  gradually  towards  zero  with  a  period  close  to  the  mean  life  of  the  longest  lived 
delayed  neutron  precursor,  viz  80  sec.  Thus  it  is  not  possible  to  completely  shut  off  the 
chain  reaction  immediately. 

(ii)  When  p  is  greater  than  0  by  an  amount  /?,  then  the  neutron  population  grows  at  a 
rate  governed  by  the  extremely  short  prompt  neutron  lifetime  /  and  not  by  the  weighted 
mean  neutron  lifetime  /*.  The  reactor  under  such  conditions  is  said  to  be  super  prompt 
critical  and  very  rapid  rates  of  power  increase  become  possible.  An  important  safety 
consideration  is  to  protect  against  the  possibility  of  a  reactor  becoming  super  prompt 
critical.  On  account  of  the  importance  of  the  delayed  neutron  fraction  on  the  kinetic 
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behaviour  of  the  chain  reaction,  p  is  often  measured  in  terms  of  ft.  An  amount  of 
reactivity  equal  to  ft  is  called  a  ‘dollar  of  reactivity’  which  makes  the  reactor  super 
prompt  critical. 

3.3  Feed-back  coefficients  of  reactivity 

One  of  the  important  control  and  safety  features  of  a  nuclear  reactor  core  is  that  the 
reactivity  is  dependent  on  the  core  temperature.  As  seen  above,  when  k  is  changed  from 
1,  by  say  control  rod  movement,  the  neutron  population  changes  and  consequently  the 
power  P  changes.  This  leads  to  changes  in  the  core  temperature  T  which,  in  turn,  affect  k 
and  thereby  the  power.  Thus,  we  have  a  feed-back  loop. 

When  the  feed-back  change  in  k  opposes  the  applied  change  in  k,  then  the  reactor  has 
a  negative  feed-back  coefficient.  In  other  words,  any  addition  of  reactivity  increases 
power  which  increases  temperature,  and  which  in  turn  decreases  the  reactivity.  Thus, 
negative  feed-back  coefficients  lead  to  stability  of  operation  and  add  to  the  inherent 
safety  of  the  core.  On  the  other  hand,  a  positive  feed-back  coefficient  can  lead  to 
instability  and  power  runaways.  It  is  to  be  noted  that  the  core  temperature  can  also  be 
changed  by  changes  in  the  coolant  flow  rate  or  the  coolant  inlet  temperature.  Designing 
the  reactor  core  for  all  negative  feed-back  coefficients  of  reactivity  is  an  important 
element  in  the  overall  safety  of  the  reactor. 

Different  mechanisms  with  differing  response  times  of  action  contribute  to  the  feed¬ 
back.  The  response  times  for  the  feed-back  depend  on  the  time  taken  for  the 
temperatures  of  the  various  core  components  to  change.  These  times  are  obtained  from 
heat  transfer  equations  relating  the  component  temperatures  to  the  power,  coolant  flow 
rate  and  the  coolant  inlet  temperature. 

The  precise  features  of  the  feed-back  mechanisms  vary  from  reactor  type  to  reactor 
type.  In  a  fast  reactor,  the  three  important  mechanisms  of  reactivity  feedback  under 
normal  operating  conditions  are  the  Doppler  coefficient,  expansion  coefficients  and 
differential  expansion  coefficients.  Additional  mechanisms  of  reactivity  feed-back 
under  accident  conditions  are  by  coolant  voiding  and  fuel  slumping. 

3.3 a  Doppler  coefficient  of  reactivity:  The  probability  of  a  neutron  being  absorbed  by 
or  causing  fission  of  an  atom  varies  with  the  energy  of  the  neutron  and  in  a  range  of 
energies  (0  to  20  keV),  there  are  sharp  peaks  in  these  probabilities  called  resonances. 
There  is  a  basic  theorem  (see  for  example,  Hummel  and  Okrent  1970)  of  reactor  physics 
that  the  rate  of  absorption  of  or  fission  by  neutrons  in  the  resonance  energy  range 
increases  if  the  atoms  are  themselves  in  motion.  Thus,  when  the  temperature  of  the 
reactor  core  materials  increases,  the  increased  thermal  motion  of  the  atoms  causes  the 
reaction  rates  to  increase.  Consequently,  the  core  reactivity  changes  giving  rise  to  the 
Doppler  coefficient  of  reactivity. 

For  fertile  materials  U238  or  thorium,  the  predominant  reaction  is  the  absorption  of 
neutrons  and  hence  the  Doppler  effect  in  these  materials  increases  the  absorption  of 
neutrons  and  reduces  the  core  reactivity,  i.e.  the  feed-back  is  negative.  On  the  other 
hand,  for  fissile  materials  both  the  absorption  and  fission  rates  are  increased.' The 
increased  loss  of  neutrons  by  absorption  is  compensated  by  the  increased  production  of 
neutrons  by  fission  so  that  the  net  result  of  the  Doppler  effect  in  fissile  materials  is  very 
small.  The  total  Doppler  coefficient  of  the  reactor  is  the  sum  of  the  different 
contributions  of  the  fissile  and  fertile  nuclides  in  the  fuel,  with  the  contribution  from  the 
fertile  material  being  dominant. 
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The  following  are  the  important  characteristics  of  this  coefficient: 

(i)  It  is  a  prompt  coefficient  related  to  the  fuel  temperature,  which  temperature  is  the 
first  to  increase  when  the  power  increases. 

(ii)  In  a  small  fast  reactor  with  high  enrichment,  the  neutrons  have  a  short  lifetime  as 
they  are  quickly  absorbed  in  the  highly  enriched  fuel  or  quickly  leak  out  of  the  small 
reactor.  Hence  the  average  neutron  energy  is  high,  close  to  that  of  the  fission  neutrons. 
The  resonance  energy  region  is  about  100  times  lower  than  the  fission  neutron  energies 
and  in  such  small  reactors,  there  are  very  few  resonance  energy  neutrons  so  that  the 
Doppler  effect  is  negligible.  Thus,  for  example,  there  is  a  negligible  Doppler  coefficient 
in  the  fbtr. 

(iii)  In  larger  reactors  of  lower  fuel  enrichment  and  smaller  neutron  leakage,  the 
neutrons  have  a  longer  life  so  that  an  appreciable  number  of  neutrons  slow  down  into 
the  resonance  energy  region.  The  Doppler  coefficient  is  thus  large  and  is  important  for 
the  stability  and  safety  of  the  reactor.  The  coefficient  is  largest  in  oxide  fuelled  fast 
reactors  which  have  low  average  neutron  energies  on  account  of  the  presence  of  two 
moderating  atoms  (oxygen)  per  atom  of  fuel. 

(iv)  The  calculation  of  the  Doppler  coefficient  in  fast  reactors  is  difficult  on  account  of 
the  small  fraction  of  neutrons  entering  the  resonance  energy  range  from  the  fission 
neutron  energies.  Detailed  calculations  with  accurate  nuclear  data  are  required 
(Nicholson  and  Fischer  1968)  to  establish  correctly  the  number  density  of  neutrons  in 
the  resonance  energy  range.  This  small  fraction  of  neutrons  contribute  little  to  the 
overall  neutron  balance,  but  they  are  the  main  cause  of  the  Doppler  reactivity  effect. 
Many  years  of  worldwide  effort  have  resulted  in  an  ability  to  calculate  this  coefficient 
with  an  accuracy  of  about  20%  (Ganesan  1979). 

(v)  The  increase  in  reaction  rates,  as  the  thermal  motion  of  the  target  atoms  increases, 
tends  to  get  saturated  at  higher  temperatures.  The  Doppler  coefficient  hence  decreases 
as  the  temperature  increases.  Commonly,  a  fitted  linear  decrease  is  used,  so  that  7d/c/dT 
will  be  a  constant  called  the  Doppler  constant  which  for  the  larger  fast  reactors  varies 
from  —0  001  to  —0  01  according  to  the  fuel  and  size  of  the  reactor.  For  the  carbide 
fuelled  version  of  pfbr,  the  Doppler  constant  is  calculated  to  be  —  0  0064  for  normal 
operating  conditions  and  —  0  0046  under  coolant  voided  conditions. 

(vi)  A  very  important  series  of  experiments  in  connection  with  the  Doppler  coefficient 
were  conducted  in  1972  in  the  sefor  lmfbr  of  usa  (Harris  1973).  This  was  a  500  litre 
oxide  fuelled  lmfbr  with  neutron  spectrum  and  fuel  temperatures  representative  of 
large  lmfbr  s.  The  experimental  programme  was  designed  to  demonstrate  that  a  prompt 
negative  Doppler  coefficient  of  sufficient  magnitude  makes  the  lmfbr  an  extremely 
stable  and  inherently  safe  reactor  system.  The  super  prompt  critical  transient  tests 
involving  reactivity  insertion  of  upto  1*3$  in  0T  sec  provided  a  convincing  demonstra¬ 
tion  of  the  effectiveness  of  the  Doppler  coefficient  in  limiting  the  energy  release  in  a  fast 
reactor  power  excursion. 


3.3 b  Expansion  coefficients  of  reactivity :  A  rise  in  temperature  causes  the  different 
materials  in  the  fast  reactor  core  to  expand  thereby  reducing  the  number  of  atoms  per 
unit  volume.  Reaction  rates  depend  on  the  product  of  neutron  density  and  the  number 
density  of  atoms,  and  so  all  reaction  rates  decrease  and  the  fraction  of  neutrons  leaking 
out  of  the  core  increases.  The  effect  on  the  reactivity  varies  according  to  the  type  of 
material  suffering  expansion. 
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Fuel  density  reduction  leads  to  loss  of  reactivity  on  account  of  reduced  fuel-neutron 
interactions  and  increased  neutron  leakage.  On  account  of  the  gap  between  fuel  pellets 
and  the  clad  inner  wall,  the  radial  fuel  density  reduction  is  in  fact  governed  by  the  radial 
expansion  of  the  structural  steel.  The  fuel  pellets’  axial  expansion  is  governed  by  the  fuel 
temperature  and  is  a  fast  acting  coefficient.  In  small  fast  reactors,  the  Doppler 
coefficient  is  negligible  and  so  the  radial  and  axial  fuel  expansion  coefficients  assume 
importance.  The  axial  fuel  expansion  coefficient  for  ceramic  fuel  changes  as  the  fuel 
structure  changes  due  to  irradiation,  and  the  study  of  its  behaviour  with  burn-up  of  the 
fuel  is  important.  Much  useful  information  on  the  behaviour  of  this  coefficient  with 
burn-up  of  mixed  plutonium-uranium-oxide  fuel  has  been  obtained  from  the  operation 
of  the  French  reactors  rapsodie  and  phenix  (Berthet  et  al  1982;  Clauzon  et  al  1976). 

The  expansion  of  the  sodium  coolant  has  several  opposite  effects  on  the  reactivity: 

(i)  Reduction  in  the  absorption  of  neutrons  by  sodium  leading  to  a  small  gain  in 
reactivity. 

(ii)  Reduction  in  the  neutron  moderating  scattering  collisions  with  sodium  atoms. 
Hence,  the  average  energy  of  neutrons  in  the  reactor  increases.  In  fast  reactors,  when  the 
neutron  energy  increases  the  number  of  neutrons  released  per  neutron  absorption  in 
the  fuel  increases,  and  the  parasitic  neutron  absorption  in  structural  and  coolant 
materials  decreases.  Thus,  the  reactivity  of  the  core  increases. 

(iii)  Increase  in  the  leakage  of  neutrons  from  the  core  on  account  of  reduced  sodium- 
neutron  scattering  interactions,  leading  to  a  loss  of  reactivity. 

In  small  reactors  like  fbtr,  the  third  effect  is  dominant  so  that  the  sodium  expansion 
coefficient  is  negative.  In  larger  reactors  like  pfbr,  the  third  effect  is  small  compared  to 
the  second  one  so  that  the  coefficient  becomes  positive. 

The  expansion  of  steel  material  in  the  core  causes  effects  similar  to  that  of  sodium, 
though  in  this  case,  as  steel  is  composed  of  atoms  much  heavier  than  sodium,  the  effect 
due  to  change  in  neutron  moderation  is  small  and  those  due  to  change  in  leakage  and 
absorption  dominate.  This  expansion  coefficient  is  negative  in  small  reactors  and 
becomes  positive  in  large  ones. 

It  is  difficult  to  distinguish  experimentally  the  differing  speeds  of  response  of  the 
expansion  coefficients  associated  with  the  fuel,  structure  and  coolant  in  a  fast  reactor 
core  and  a  total  expansion  coefficient  with  a  single  time  constant  of  response  is  found 
adequate  to  describe  the  kinetic  behaviour.  This  total  expansion  coefficient  is  always 
negative  and  has  a  response  time  of  1  to  5  sec.  In  fbtr,  the  total  expansion  coefficient  has 
a  value  of  about  —  4  x  10"  5/°C  with  a  response  time  of  about  2  sec.  These  expansion 
coefficients  can  be  relied  upon  and  are  important  for  turning  round  slow  reactivity 
increases  and  preventing  system  upsets  leading  to  unsafe  conditions. 


3.3c  Differential  expansion  coefficients:  On  account  of  leakage  of  neutrons,  the  neutron 
density  and  consequently  the  power  is  greater  towards  the  centre  of  the  reactor  than 
towards  the  boundary.  Further,  the  coolant  enters  at  the  bottom  of  the  core,  gets  heated 
and  leaves  at  the  top.  Thus,  there  are  thermal  gradients  across  the  core  and  changes  in 
these  gradients  can  cause  reactivity  changes.  The  major  effects  are  radial  bowing  of  the 
fuel  subassemblies  governed  by  the  radial  temperature  gradient  changes;  and  relative 
movement  between  the  control  rods  and  core  governed  by  the  temperature  difference 
between  the  hot  sodium  at  the  top  in  which  the  control  rod  shafts  dip,  and  the  cooler 
sodium  at  the  bottom  around  the  core  support  structures. 


202 


C  V  Sundaram  and  S  M  Lee 


Inward  bowing  of  the  fuel  subassemblies  leads  to  an  increase  of  density  of  core 
materials  in  the  core  centre,  where  the  neutron  density  is  higher.  This  leads  to  an 
increase  in  reactivity.  Such  a  positive  bowing  coefficient  was  present  in  the  early  fast 
reactor  ebr-1  and  led  to  a  core  meltdown  accident  (Thompson  and  Beckerley  1964) 
during  special  experiments  for  measuring  the  feed-back  coefficients.  Modern  designs 
for  lmfbr’s  incorporate  spacer  pads  at  suitable  levels  between  fuel  subassemblies  to 
make  the  bowing  coefficients  zero  or  slightly  negative.  Thus  fbtr  has  a  slightly  negative 
bowing  coefficient  and  pfbr  would  be  also  designed  to  have  a  negative  one. 

The  reactivity  coefficient  associated  with  control  rod  shafts  expansion  is  a  fairly  large 
negative  one  but  acts  after  a  time  lag.  In  the  case  of  fbtr,  it  is  expected  to  have  a  value  of 
—  2*6  x  10-5/°C  with  a  response  time  of  25  sec. 

3.3d  Sodium  void  coefficient :  Exactly  similar  arguments  as  for  the  reactivity  change 
during  the  expansion  of  sodium  show  that  there  will  be  a  reactivity  change  if  the  sodium 
coolant  is  accidentally  voided  from  the  core.  This  is  called  the  sodium  void  coefficient 
and  is  negative  for  small  cores  like  fbtr  and  positive  ( 4-  3  dollars)  for  large  cores  like 
pfbr.  Obviously,  there  are  strong  safety  implications  associated  with  the  positive 
coolant  voiding  coefficient,  whose  value  can  attain  several  dollars  for  a  large  power 
reactor.  Comprehensive  theoretical  and  experimental  investigations  for  the  prediction 
of  this  coefficient  have  been  made  leading  to  a  satisfactory  understanding  (Kiisters  and 
Ganesan  1978).  Currently,  the  successful  designs  to  have  a  lower  sodium  void 
coefficient  are  the  so  called  heterogeneous  cores  in  which  internal  blanket  material 
zones  are  placed  within  the  core  (see  for  example,  Inoue  et  al  1982).  In  this  case,  voiding 
leads  to  neutron  leakage  into  the  internal  blanket  zones  thus  reducing  reactivity. 
However,  no  heterogeneous  core  fast  reactor  has  been  operated.  The  alternate 
approach  has  been  to  maintain  a  positive  voiding  coefficient  and  ensure  by  proper 
safeguards  and  design  that  the  sodium  cannot  be  voided  suddenly.  This  has  been  found 
particularly  easy  for  pool  type  reactors  where  the  possibility  of  loss  of  sodium  by  pipe 
break  is  not  possible  and  where  due  to  thermal  inertia,  there  is  considerable  time  lag  for 
the  sodium  to  boil  when  there  is  a  loss  of  flow.  Thus,  for  example,  both  for  the  250  M  We 
PHENixand  the  1200  MWe  superphenix  reactors,  a  loss  of  electric  supply  to  the  primary 
and  secondary  pumps  followed  by  a  failure  to  shut  down  the  reactor  does  not  result  in 
sodium  boiling  (Gouriou  et  al  1982). 

3.3c  Fuel  slumping  reactivity  gain:  Unlike  thermal  reactors,  in  a  fast  reactor  the 
spacing  between  the  fuel  pins  is  governed  by  heat  removal  considerations  and  not  from 
reactivity  considerations.  Compaction  of  fuel  tends  to  reduce  the  neutron  leakage  and 
increase  the  reactivity.  Hence,  fuel  melting  and  slumping  is  the  most  serious  accident  in 
a  fast  reactor.  Traditionally,  the  worst  rates  of  reactivity  addition  were  calculated  from 
a  gravitational  collapse  of  the  core  assuming  gross  core  melting  due  to  loss  of  flow  of 
coolant.  The  total  reactivity  addition  can  be  several  tens  of  dollars  and  the  rate  of 
reactivity  addition  can  be  of  the  order  of  a  hundred  dollars  per  second.  The  smaller  the 
reactor,  the  greater  is  the  reactivity  gain  due  to  slumping.  It  is  to  be  noted  that  the  values 
for  the  slumping  reactivity  gain  and  the  rate  of  reactivity  addition  given  above  are 
extreme  theoretical  values  and  realistic  modelling  of  accident  sequences  does  not  give 
such  values.  However,  it  was  these  high  values  which  were  the  early  causes  for  concern 
and  gave  the  impetus  for  extensive  fast  reactor  safety  research. 
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3.4  Engineering  features 

There  are  many  engineering  design  features  which  enhance  the  inherent  safety  of  the 
lmfbr.  An  important  feature  is  that  the  sodium  system  is  a  low  pressure  system  and  the 
maximum  sodium  temperature  (~  550°C)  is  far  below  the  sodium  boiling  point. 
Further,  sodium  has  extremely  good  heat  transfer  and  natural  convection  properties 
which  ensure  good  decay  heat  removal  even  under  accident  conditions.  General  design 
features  enhancing  the  safety  are  (Graham  1971): 

(i)  use  of  special  engineering  design  codes  for  fast  reactors  with  appropriate  safety 
factors, 

(ii)  limited  rates  of  control  rod  withdrawal, 

(iii)  use  of  diverse  and  redundant  shutdown  systems, 

(iv)  provision  of  flywheel  action  in  main  sodium  pumps  to  maintain  flow  under  loss  of 
power, 

(v)  provision  of  alternate  grid  supplies, 

(vi)  use  of  multiple  radial  inlets  to  subassemblies  to  avoid  flow  blockages, 

(vii)  use  of  wrapped  and  restrained  piping  to  avoid  pipe  breakage;  use  of  safety  guard 
tanks  around  components  like  reactor  vessel,  ihx,  pumps,  etc. 

(viii)  use  of  double  walled  piping,  check  valves,  hydraulic  diodes,  reserve  coolant 
volumes,  etc. 

The  design  features  providing  inherent  safety  in  fbtr  are  diverse.  The  coolant  entry 
to  subassemblies  is  via  multiple  (twelve)  openings  which  cannot  get  blocked 
simultaneously — even  with  50%  blockage  the  flow  reduction  is  only  10%.  Hold-down 
springs  are  provided  at  the  feet  of  subassemblies  even  though  the  sodium  fluid  dynamic 
forces  are  insufficient  to  lift  a  subassembly.  In  addition  to  flywheels  which  are  provided 
On  pumps  to  prevent  abrupt  flow  stoppages,  diverse  and  independent  power  supplies 
feed  the  pumps  including  emergency  batteries  and  diesel  generators.  The  general  safety 
features  which  would  be  incorporated  in  pfbr  include  a  pool  type  design  using  a  large 
double  walled  vessel  to  house  the  reactor.  Multiple  inlets  for  sodium  entry  into 
subassemblies  are  to  be  provided.  Emergency  power  supplies,  flywheels  on  pumps  and 
two  independent  shutdown  systems  will  be  provided. 


4.  The  plant  protection  system 

4.1  Elements  of  the  plant  protection  system 

Despite  the  assurance  offered  by  careful  design,  construction  and  operation,  the  second 
and  most  important  level  of  safety  is  the  pps,  which  can  handle  a  wide  range  of 
conceivable  abnormal  incidents  and  malfunctions  and  safely  shut  down  the  reactor. 
Basically  the  system  consists  of: 

(i)  a  wide  variety  of  instruments  to  monitor  the  plant  operating  parameters  and 
characteristics, 

(ii)  assured  shutdown  systems  triggered  by  signals  from  the  monitors, 

(iii)  decay  heat  removal  systems. 

Core  damage  may  occur  if  the  shutdown  systems  or  the  decay  heat  removal  systems 
fail  on  demand.  Hence  proper  operation  of  the  pps  is  very  important.  This  is  assured  by 
means  of  redundancy,  diversity  and  fail-safe  operation  of  the  components  of  the  pps. 
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Redundancy  refers  to  the  use  of  two  or  more  similar  components  in  parallel  by  means 
of  which  it  is  assured  that  the  pps  does  not  fail  when  needed,  and  at  the  same  time,  false 
plant  shut-downs  are  not  caused  by  instrument  errors.  Diversity  in  shutdown  systems, 
electric  power  systems,  etc.  is  provided  to  prevent  common  mode  failure,  i.e.  a  number  of 
failures  resulting  from  a  single  cause.  By  fail-safe  operation  is  meant  that  failure  of  a 
component  of  the  pps  results  in  shutdown  of  the  plant  rather  than  continued 
unprotected  operation.  Recent  designs,  like  the  snr  300  and  superphenix,  provide  for 
two  independent  plant  protection  systems  (Birkhofer  et  al  1982;  Berlin  et  al  1982). 

The  pps  often  includes  a  process  computer  which  monitors  and  processes  the 
information  of  all  signals  relevant  to  the  safety  of  the  plant.  In  fbtr,  this  is  performed  by 
the  Central  Data  Processing  System  (a  pair  of  eciltdc-3  16  computers)  which  can  also 
initiate  alarms  and  safety  actions. 

4.2  Plant  monitoring  systems 

The  most  important  monitoring  systems  in  a  fast  reactor  plant  are  for  neutron  and; 
sodium  temperature  and  flow  monitoring;  and  clad  rupture  detection.  Other  monitor¬ 
ing  systems  are  for  sodium  levels  and  sodium  leaks  and  aerosol  monitoring;  and  steam 
generator  leak  detection. 

4.2 a  Neutron  monitoring:  Neutron  detectors  and  associated  electronics  are  required  in 
order  to  monitor  and  control  the  power  level  as  well  as  to  provide  signals  for  safety 
action.  The  neutron  monitoring  system  provides  signals  for  alarms,  power  reduction  or 
reactor  shutdown  when  the  neutron  density  level  is  too  high  or  the  rate  of  increase  of 
neutron  density  is  too  high.  In  addition,  most  lmfbr  s  contain  reactivity  meters  which 
use  measured  neutron  density  as  a  function  of  time  to  derive  the  reactivity  changes, 
which  can  also  signal  for  safety  action  if  found  excessive.  The  neutron  monitoring 
system  is  triplicated  and  safety  action  is  taken  when  any  two  of  the  three  monitors  call 
for  it. 

On  account  of  high  gamma  radiation  fields,  all  neutron  detectors  should  discriminate 
against  gammas.  Both  for  fbtr  and  pfbr,  the  neutron  detectors  considered  are  BF3 
counters,  boron  coated  counters,  fission  counters  and  boron  coated  compensated  ion 
chambers.  Location  of  the  detectors  outside  the  primary  vessel  is  desirable  from 
considerations  of  temperature  and  radiation.  In  fbtr,  the  detectors  are  located  at  the 
start  of  the  radial  concrete  shielding  outside  the  reactor  vessel.  For  pfbr,  location  of 
detectors  under  the  main  vessel  is  under  consideration.  Further,  for  a  pool  type  reactor 
like  pfbr,  adequate  neutron  levels  are  not  obtained  outside  the  main  vessel  for 
shutdown  monitoring  and  use  of  in-vessel  detectors  is  to  be  considered. 

4.2 b  Sodium  temperature  and  flow  monitoring:  Monitoring  of  sodium  temperatures 
and  early  detection  of  sodium  boiling  is  very  important  for  core  safety  especially  with 
positive  coolant  voiding  coefficients.  Most  commonly,  thermocouples  (Chromel- 
Alumel  or  Ni-Cr)  and  Resistance  Temperature  Detectors  (rtd)  are  used.  In  fbtr,  each 
fuel  subassembly  is  monitored  by  2  thermocouples  and  a  similar  system  may  be 
considered  for  pfbr. 

Sodium  flows  are  measured  both  by  Venturi  flow  meters  and  magnetic  flow  meters. 
While  the  former  are  very  accurate,  their  response  time  is  too  slow  for  use  in  the  pps, 
which  purpose  is  served  by  the  latter. 
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Trip  signals  are  actuated  by  high  sodium  temperature,  low  flows,  mismatch  of  power 
to  flow  values,  loss  of  electric  power  supply,  etc. 

The  onset  of  sodium  boiling  is  also  sought  to  be  detected  in  modern  lmfbr  s  by  the 
methods  of  neutron  noise  and  acoustic  noise  detection.  Bubbles  formed  when  sodium 
boils  give  rise  to  both  reactivity  fluctuations  (see  the  discussion  in  §3.3b)  as  well  as  to 
pressure  variations  in  the  metal.  These  fluctuations  can  be  recorded  by  neutron 
monitors  or  by  pressure  transducers  and  the  onset  of  boiling  detected. 

4.2c  Clad  rupture  detection:  Detection,  localisation  and  removal  of  failed  fuel  with 
ruptured  clad  is  required  to  prevent  excessive  contamination  of  the  primary  coolant 
and  to  prevent  flow  blockages  that  could  arise  on  account  of  gradual  fuel  swelling  when 
sodium  contacts  the  fuel.  Both  in  fbtr  and  in  pfbr,  the  methods  of  failed  fuel  detection 
are  to  be  by  monitoring  of  the  cover  gas  fission  product  activity  and  by  monitoring  of 
delayed  neutrons  in  the- primary  coolant.  Excessive  activities  from  these  monitors 
trigger  a  plant  shutdown. 

The  cover  gas  fission  product  activity  is  due  to  gaseous  fission  products  like  Xe  and 
Kr  escaping  from  ruptured  fuel  pins.  These  gases  require  time  to  reach  the  cover  gas 
plenum  and  the  response  time  for  clad  rupture  detection  by  cover  gas  monitoring  is  of 
the  order  of  minutes.  On  the  other  hand,  fission  products  like  I137  and  Br87  which  are 
delayed  neutron  precursors  get  dissolved  in  the  sodium  upon  clad  rupture  and  the 
emitted  neutrons  are  detected  in  a  bypass  loop  of  the  primary  circuit.  The  response  time 
of  this  system  is  a  function  of  the  coolant  transport  time  and  is  in  the  range  of  20-30  sec. 
Identification  of  the  failed  fuel  subassembly  is  an  important  problem  and  one  of  the 
methods  followed  in  some  reactors  is  to  sample  the  sodium  from  each  subassembly  by 
selector  valves  and  monitor  the  delayed  neutrons. 

4.3  Reactor  shutdown  systems 

Besides  the  function  of  reactivity  or  power  adjustment,  the  control  rods  must  be  able  to 
shut  down  the  reactor  under  any  foreseen  conditions.  It  is  important  to  assess  the 
mechanisms  of  accidental  reactivity  insertions  and  ensure  that  the  shutdown  system  can 
compensate  for  these.  Such  mechanisms  include  melting  of  a  fuel  subassembly,  sodium 
voiding,  inadvertent  withdrawal  of  a  control  rod,  introduction  of  moderator,  sudden 
flow  increases,  etc. 

In  fbtr,  the  reactor  control  is  by  six  enriched  boron  carbide  control  rods.  The  power 
to  drive  the  rods  is  supplied  independently  to  each  of  the  six  control  rod  drive 
mechanisms  and  in  the  event  of  power  failure  or  a  rapid  shutdown  trip  action  the  drive 
mechanisms  and  rods  fall  under  gravity  in  about  400  m  sec  to  shut  down  the  reactor. 
The  electrical  connections  are  such  that  only  one  rod  can  be  withdrawn  at  a  time  at  a 
rate  of  1  mm/sec  which  limits  the  reactivity  addition  rate  to  a  low  6  x  105/sec. 

As  in  modern  lmfbrs,  pfbr  will  have  two  independent  shutdown  systems.  The 
primary  shutdown  system  is  expected  to  consist  of  9  control  rods  to  be  used  for  normal 
operations  as  well  as  for  rapid  shutdown.  In  addition,  a  completely  independent 
secondary  shutdown  system  of  3  control  rods  based  on  a  different  design  principle 
would  be  able  to  override  abnormal  reactivity  additions  and  shut  off  the  reactor  even  if 
all  the  primary  rods  fail  to  act.  Their  design  would  be  such  that  they  can  enter  even 
distorted  cores,  and  would  automatically  operate  on  sodium  overheating  due  to  Curie 
point  demagnetisation  of  magnets.  Alternate  design  principles  for  the  secondary  system 
under  consideration  are: 
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(i)  Articulated  links  with  spherical  couplings  (Pignatelli  et  al  1982). 

(ii)  Hydraulically  suspended  absorber  balls  (Specht  et  al  1976). 

Fuel  loading  and  unloading  in  an  lmfbr  is  done  with  all  control  rods  fully  inserted 
and  disconnected  from  their  drive  mechanisms.  It  is  important  to  ensure  that  the  core  is 
not  made  critical  during  refuelling.  Hence,  accurate  measurement  of  the  degree  of 
subcriticality  of  the  shutdown  core  and  the  reactivity  worths  of  the  loaded  sub- 
assemblies  is  important. 


4.4  Decay  heat  removal 

The  decay  heat  is  initially  about  6  to  7  %  of  the  reactor  power  and  gradually  falls  with 
time  becoming  around  1  %  after  an  hour.  An  important  part  of  the  pps  is  the  provision 
of  adequate  decay  heat  cooling  under  shutdown  conditions  for  as  long  as  necessary  (up 
to  a  month  or  more)  even  under  conditions  of  coolant  system  failures. 

Decay  heat  removal  in  fast  power  reactors  is  ensured  by  a  diversity  of  means  (Lauret 
et  al  1982;  Agarwal  and  Guppy  1981): 

(i)  multiplicity  of  primary  heat  transport  loops  ensuring  normal  decay  heat  removal  by 
the  operation  of  a  single  loop  alone, 

(ii)  redundancy  and  diversity  in  power  supplies  to  pumps, 

(iii)  forced  circulation  in  the  immediate  period  following  shutdown  (when  decay  heat 
generation  rates  are  high)  by  design  provisions,  such  as  flywheels  for  gradual  coastdown 
of  pumps, 

(iv)  continued  forced  circulation  by  low  speed  operation  of  pumps  by  means  of  pony 
motors  backed  by  alternate  power  supplies/emergency  diesel  generator  sets/batteries, 

(v)  provision  of  independent  back-up  decay  heat  removal  systems, 

(vi)  design  of  normal  as  well  as  back-up  heat  removal  systems,  such  that  setting-up  of 
good  heat  removal  by  natural  convection  is  facilitated. 

In  pool  type  lmfbr  s,  the  large  thermal  capacity  of  the  sodium  bulk  acts  as  a  heat  sink 
and  allows  a  long  time  interval  (at  least  several  hours)  for  alternate  measures  to  be  taken 
before  the  sodium  boils.  The  possibility  of  complete  loss  of  sodium  from  the  primary 
heat  removal  circuits  is  also  not  present  in  such  reactors. 

For  fbtr,  decay  heat  removal  is  ensured  by  a  variety  of  measures  starting  with  the 
operation  control  that  the  reactor  will  be  operated  only  with  both  heat  transport  loops. 
Thus  one  loop  will  always  be  available  to  remove  decay  heat  under  upset  conditions. 
Further,  emergency  batteries  and  diesel  generators  provide  power  for  the  pumps  on 
grid  non-availability.  Natural  convection  is  found  to  be  able  to  remove  about  300  kW  of 
heat  with  none  of  the  pumps  operational.  In  case  the  primary  circuit  is  not  available  due 
to  sodium  leak,  decay  heat  removal  is  by  the  emergency  cooling  system,  where  cooling 
nitrogen  is  circulated  in  the  double  envelope  of  the  reactor  vessel.  Further,  flooding 
tanks  full  of  sodium  have  been  provided  to  dump  sodium  into  the  reactor  vessel  and 
maintain  the  sodium  level  in  case  of  a  primary  circuit  leak.  Finally,  the  biological  shield 
cooling  system  can  also  remove  decay  heat.  These  provisions  are  considered  adequate 
to  provide  decay  heat  cooling  for  any  upset  conditions. 

In  the  case  of  pfbr,  there  are  to  be  four  independent  loops  and  normal  decay  heat 
removal  is  ensured  when  any  one  of  the  loops  is  operational.  Physical  separation  of  the 
loops  will  be  provided  to  avoid  common  failures.  To  provide  forced  cooling  following 
shutdown,  design  provisions  are  made:  flywheel  inertia  for  appropriate  pumps  coast- 
down,  and  pony  motors  backed  by  alternate  power  supplies/diesel  generators/batteries. 


Fast  reactor  safety 


207 


Setting-up  of  natural  convection  for  heat  removal  would  be  ensured  by  suitable 
elevations  of  core,  ihx  and  terminal  sodium/air  heat  exchangers.  It  is  estimated  that  two 
loops  in  natural  convection  could  provide  the  required  decay  heat  removal  capacity.  To 
provide  cooling  in  case  of  common  mode  failure  resulting  in  non-availability  of  all 
normal  loops,  an  independent  back-up  decay  heat  removal  system  would  be  provided. 


5.  Principles  of  containment 

5.1  Containment  barriers 

An  operating  commercial-sized  fast  breeder  reactor  would  have  an  inventory  of  a  few 
thousand  kilograms  of  plutonium  and  a  few  hundred  kilograms  of  fission  products. 
The  total  radioactivity  is  in  millions  of  curies.  As  far  as  environmental  effects  are 
concerned,  the  main  nuclides  of  importance  are  the  plutonium  isotopes,  Am241,  Cm244, 
tritium,  Kr85,  Sr90,  1129,  1131,  1133,  Xe133,  Cs134,  Cs137  and  the  activation  product  C14. 
The  potential  for  escape  varies  according  to  whether  the  radioactive  material  is  gaseous 
(like  Xe  or  Kr),  volatile  (like  I,  Br,  Cs,  Rb)  or  non-volatile.  The  release  of  radioactive 
material  to  the  environment  during  normal  operation  and  under  accident  conditions  is 
prevented  by  three  physical  barriers. 

The  first  barrier  is  the  cladded  fuel  element.  Non-volatile  fission  products  account  for 
some  98  %  of  the  generated  radioactivity  and  are  retained  in  the  solid  fuel  material.  The 
fuel  material  itself,  as  well  as  the  volatile  and  gaseous  fission  products  are  contained  by  a 
gas-tight  metallic  (steel)  clad. 

The  second  barrier  is  the  gas-tight  primary  coolant  boundary,  which  also  retains, 
along  with  the  radio-active  primary  sodium  and  cover  gas,  the  radioactive  material  that 
may  escape  from  failed  fuel  elements.  This  barrier  is  constituted  by  a  double  walled 
reactor  vessel,  bolted  down  top  plugs,  etc.  Monitoring  of  coolant  and  cover  gas  activity 
enables  shutdown  of  the  plant  and  clean-up  action  to  be  taken  before  the  radioactivity 
in  the  primary  circuit  reaches  unacceptable  levels. 

The  reactor  containment  building,  usually  of  reinforced  concrete  and  steel,  provides 
the  third  barrier  to  the  escape  of  radioactivity.  Different  types  of  containment  buildings 
have  been  considered  for  fast  reactors  (Graham  1971;  Seeman  and  Armstrong  1978).  A 
single  containment  building  with  a  low  leak  rate  to  the  outside  has  been  used  in  small 
fast  test  reactors  like  ebr-II,  rapsodie,  fftf,  joyo,  fbtr,  etc.  Double  containment  has 
been  used  for  the  older  reactors  effbr  (60  MWe),  sefor  as  well  as  for  recent  large 
reactors  like  superphenix-1  (1200  MWe)  and  snr-300.  These  have  within  the  contain¬ 
ment  building,  a  steel  containment  dome  with  inert  gas  around  the  reactor.  The  250 
MWe  prototype  fast  reactors  pfr  and  phenix  had  been  provided  with  leak  resistant 
reactor  buildings  without  regular  containment  building  features. 

The  design  of  these  successive  radioactive  barriers  is  such  that  the  radioactive  release 
under  normal  operating  conditions  is  acceptably  low  as  per  the  radiological  guidelines 
for  the  chosen  site  and  the  radioactivity  is  adequately  contained  under  accident 
conditions. 

In  the  case  of  fbtr,  the  rotating  plugs  at  the  top  of  the  reactor  vessel  are  fitted  with  a 
plug  restraint  mechanism  with  hold-down  bolts.  These  have  been  designed  to  withstand 
and  limit  the  radioactivity  release  into  the  reactor  containment  building  under  certain 
design  basis  accident  scenarios.  The  strength  of  the  containment  building  and  its 
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leakage  rate  are  designed  such  that  this  radioactivity  release  accompanied  by  sodium 
fires  is  easily  contained  with  no  hazard  to  the  public.  Similarly,  in  pfbr,  the  double- 
walled  reactor  vessels  enclosed  in  concrete  vaults  with  top  closures  are  to  be  designed  to 
withstand  design  basis  accidents  without  release  of  radioactivity. 

5.2  Energy  release  calculations 

In  order  to  ensure  the  adequacy  of  the  engineered  safety  features  and  the  containment 
of  radioactivity  under  all  possible  circumstances  by  the  reactor  vessel  and  containment 
building,  it  is  usual  to  study  extremely  unlikely  but  assumed  credible  accident  sequences 
and  then  estimate  the  energy  release  and  other  consequences.  These  hypothetical 
accidents  assume  complete  failure  of  the  plant  protection  system  and  are  called  design 
basis  accidents.  Their  precise  features  vary  from  reactor  type  to  reactor  type.  In  fast 
reactors,  the  design  basis  accident  is  taken  as  fuel  compaction  resulting  from  meltdown 
of  fuel  leading  to  what  is  called  a  Hypothetical  Core  Disassembly  Accident  (hcda) 
whose  mechanical  work  has  to  be  assessed  to  adequately  design  the  engineered 
safeguards. 

The  classical  model  for  the  energy  release  in  an  hcda  is  that  due  to  Bethe  and  Tait 
(1956)  which  has  been  improved  upon  by  later  workers  (e.g.  Nicholson  1964).  The 
model  assumes  an  initial  large  reactivity  insertion  rate  from  fuel  slumping  which  then 
leads  to  a  power  excursion  and  energy  build-up  in  a  molten  core  which  is  limited  but  not 
terminated  by  the  Doppler  coefficient.  Pressures  then  build  up  in  the  fluid  core  which 
expands  and  terminates  the  excursion. 

Important  information  required  is  the  equation  of  state  of  core  materials  upto  high 
temperatures  («  7000°C)  which  are  not  always  accurately  available.  The  rate  of 
reactivity  addition  is  a  second  important  parameter  on  which  very  much  depends  the 
value  of  the  final  energy  release.  This  rate  depends  on  the  kind  of  accident  initiation. 
The  Doppler  coefficient  is  a  third  important  parameter  governing  the  energy  release. 
Calculations  indicate  (Wilson  1977)  that  a  Doppler  coefficient  with  —Tdk/dT  greater 
than  0  002  is  sufficient  to  effectively  reduce  the  energy  build-up  by  a  factor  100.  Lastly, 
the  initial  power  and  temperature  as  well  as  the  amount  of  void  space  that  the  fuel 
vapour  has  to  fill  before  the  disassembly  can  start,  are  important  initial  conditions 
governing  the  final  energy,  release. 

Computer  codes  like  venus-II  (Jackson  and  Nicholson  1972)  calculate  the  heat 
energy  developed  accurately,  subject  to  the  uncertainties  in  the  initial  conditions. 
However,  calculation  of  the  conversion  of  the  heat  energy  to  mechanical  energy  or  more 
precisely  the  forces  and  pressures  which  can  rupture  the  containment  barriers  is  more 
problematic.  Research  on  fuel  coolant  interaction  (fci)  and  shock  structure  interactions 
are  of  relevance  here.  Generally,  isen  tropic  expansion  of  the  vapour  of  all  core  materials 
is  taken  as  the  mechanism  of  conversion  to  mechanical  work,  and  recent  calculations 
with  the  comprehensive  computer  code  simmer-II  (Smith  1978)  show  that  this 
approach  is  safe  and  conservative. 

As  noted  earlier,  the  energy  release  in  an  hcda  is  dependent  on  the  initial  conditions 
which  in  turn  depend  on  the  details  of  the  accident  initiation  and  the  accident  sequence. 
Hence,  a  detailed  mechanistic  modelling  of  the  accident  sequence  is  necessary  for 
realistic  estimates  of  the  energy  release.  This  has  given  rise  to  a  vast  amount  of  research 
in  this  field. 

For  a  mechanistic  analysis  of  design  basis  accidents,  various  possible  accident 
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initiators  have  been  considered  (Graham  1971)  and  the  accident  sequences  usually 
studied  are  the  unprotected  Loss  of  Flow  Accident  (lofa)  and  the  unprotected 
Transient  Over  Power  Accident  (topa)  resulting  from  a  postulated  failure  of  the  pps 
alongwith  either  sodium  pumps  failure  or  accidental  control  rod  withdrawal.  Large, 
sophisticated  computer  codes  like  SAS4A  (Jackson  and  Stevenson  1981)  have  been 
created  to  trace  these  accident  sequences.  The  in-pile  safety  related  experimentation  in 
the  treat  and  the  cabri  facilities  have  provided  very  important  data  for  the  successful 
modelling  of  the  design  basis  accidents  (see  for  example,  Spencer  et  al  1976;  Dadillon 
et  al  1982). 

It  is  important  to  note  here  that  in  the  last  few  years,  there  is  a  re-evaluation  of  the 
role  of  hcda  from  the  probabilistic  viewpoint.  The  hcda  is  found  to  have  no  initiator  in 
a  modern  lmfbr  (Fauske  1981)  and  should  not  influence  the  plant  design.  Thus,  detailed 
mechanistic  considerations  beyond  early  termination  is  to  be  considered  redundant. 

In  the  case  of  fbtr  as  well  as  pfbr,  design  basis  accidents  have  been  studied  in  detail 
for  various  fuel  types  and  core  configurations  using  computer  codes  predis  (Singh  et  al 
1981)  and  venus-II.  The  results  of  these  analyses  have  provided  the  basis  for  the 
containment  design. 


6.  Conclusion 

A  review  has  been  made  of  the  principles  of  fast  reactor  safety  and  the  different  factors 
contributing  to  the  overall  safety  of  a  modern  lmfbr  have  been  highlighted.  The 
important  conclusion  is  that  the  design,  construction  and  operation  of  an  inherently 
safe,  well  protected  lmfbr  pose  no  insurmountable  problems,  and  much  of  the 
uncertainties  surrounding  fast  reactor  safety  issues  have  been  resolved  by  con¬ 
temporary  research. 
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Engineering  development  and  safety  back-up  for 
nuclear  power  programme 

P  R  DASTIDAR  and  A  KAKODKAR 

Reactor  Group,  Bhabha  Atomic  Research  Centre,  Bombay  400085,  India 

Abstract.  This  article  reviews  the  engineering  development  and  safety  aspects  that  are 
relevant  to  the  nuclear  power  programme  being  pursued  in  the  country.  Some  of  the  important 
aspects  have  been  discussed  in  detail  bringing  out  the  current  status  and  also  the  directions  for 
further  work. 

Keywords.  Nuclear  power  programme;  engineering  development;  safety  back-up;  pre¬ 
ssurised  heavy  water  reactor;  emergency  core  cooling  system;  thermosyphon;  coolant  tubes 

PACS  No.  28  45;  28  40 


1.  Introduction 

It  is  now  well  known  that  nuclear  power  is  the  only  source  of  energy  for  bulk  power 
generation  which  can  meet  our  future  requirements.  Our  resources  of  fossil  fuel  which 
form  the  mainstay  for  bulk  electricity  production  at  present  would  tend  to  get  depleted 
with  time  and  it  is  necessary  to  develop  the  alternate  nuclear  source  to  a  level  where  it 
can  take  over  from  the  fossil  fuel  sources  while  maintaining  the  growth  of  electricity 
generation  necessary  to  sustain  the  national  development.  The  three-stage  nuclear 
power  programme  envisaged  by  the  Department  of  Atomic  Energy  is  aimed  at 
achieving  this  objective.  As  of  now  the  emphasis  has  been  mainly  towards  mastering  of 
the  nuclear  technology  and  to  demonstrate  that  nuclear  power  stations  can  be  built 
within  the  country  with  nuclear  electricity  available  at  a  cost  which  is  competitive  with 
the  conventional  electricity.  Time  has  now  come  when  this  effort  needs  to  be  accelerated 
to  enhance  the  installed  nuclear  capacity.  The  nuclear  power  programme  of 
10,000  MWe  by  the  year  2,000  outlined  by  Dr  Ramanna  is  our  immediate  objective.  In 
this  phase,  not  only  one  would  increase  contribution  of  nuclear  electricity  but  also 
enable  production  of  sizable  quantities  of  plutonium  which,  with  the  available  uranium 
resources,  can  sustain  a  much  larger  fast  breeder  programme.  It  is  envisaged  that 
through  the  use  of  fast  breeder  reactors  an  installed  capacity  to  the  tune  of 
350,000  MWe  can  be  realised  with  available  uranium  resources.  Subsequent  to  this, 
further  generation  of  nuclear  electricity  would  be  sustained  by  large  reserves  of  thorium 
resources. 

Realisation  of  the  objective  of  10,000  MWe  capacity  by  the  year  2,000  is  envisaged  to 
be  achieved  by  installing  a  number  of  235  MWe  and  500  MWe  power  units  of  phwr 
type.  In  order  to  realise  installation  of  such  a  large  number  of  units  in  a  small  time, 
many  aspects  such  as  availability  of  finance,  manpower,  industrial  capacity,  organis¬ 
ational  setup,  safety  regulation  etc  have  to  be  looked  into  in  detail.  The  purpose  of  this 
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article  is  to  review  the  engineering  development  and  safety  back-up  that  would  be 
necessary  to  support  the  proposed  power  programme. 


2.  The  phwr  system 

Our  power  programme  is  based  on  pressurised  heavy  water  type  reactors.  We  are  now 
at  a  stage  when  for  the  235  MWe  units  we  have  gone  through  a  full  cycle  of  design, 
construction,  commissioning  and  operation.  Although  the  experience  with  operation  of 
raps-i  unit,  which  has  been  the  first  in  the  phwr  series  of  reactors,  did  reveal  some 
inadequacies,  the  performance  of  raps-2  and  maps-1  has  been  highly  encouraging.  Even 
for  raps-1  the  lessons  have  been  learnt  and  it  should  be  possible  to  bring  this  reactor 
back  on  stream  quite  soon.  It  is  worth  mentioning  at  this  stage  (Strachan  and  Brown 
1984)  that  similar  stations  in  Canada  have  shown  excellent  performance.  As  a  matter  of 
fact,  in  terms  of  lifetime  capacity  factors  upto  end  of  1981,  seven  out  of  8  candu  units, 
find  a  place  in  the  first  ten  positions  of  the  world’s  most  reliable  operating  plants  of 
capacity  greater  than  500  MWe.  The  eighth  one  has  18th  position.  The  design  of  reactor 
units  beyond  Narora  considers  new  features  that  have  been  incorporated  in  Canadian 
design  as  well  as  our  own  experiences.  There  is  thus  no  doubt  that  the  phwr  units  that 
are  being  added  would  show  even  better  performance.  There  is  however,  always  a  need 
to  look  for  further  improvements  on  a  continuing  basis.  For  the  500  MWe  units  which 
are  now  under  design  this  need  is  even  greater. 

For  realisation  of  improved  performance  as  well  as  safety  characteristics  of  the 
reactor  units  it  is  necessary  to  evolve  new  systems  and  generate  adequate  data  to 
understand  and  predict  their  behaviour  under  a  variety  of  conditions  that  could  be 
expected  during  service.  We  would  be  reviewing  some  of  these  areas  in  this  article. 

Figure  1  shows  a  schematic  of  pressurised  heavy  water  reactor  power  plant.  Reactor 
core  is  housed  in  a  cylindrical  calandria  which  contains  heavy  water  moderator  and 
reflector  at  low  pressure.  A  large  number  of  (306  in  235  MWe  units  and  392  in 
500  MWe  units)  fuel  channels  pass  through  the  calandria.  These  channels  contain  fuel 
and  also  flowing  coolant  at  high  pressure  thus  permitting  heat  removal  from  the  core. 
The  primary  coolant  circulating  pumps  circulate  the  coolant  through  the  core  and  the 
steam  generators.  The  heat  extracted  from  the  core  by  the  coolant  is  transferred  in  the 
steam  generators  to  raise  steam  which  drives  the  turbo  generator,  thus  producing 
electricity.  A  number  of  other  important  systems  also  have  to  function.  A  moderator 


Figure  1.  Pressurised  heavy  water  reactor  power  plant. 
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heat  removal  system,  on-power  fuel  handling  system  and  reactor  regulating  system  are 
some  of  the  important  systems.  In  addition,  a  number  of  safety  systems  are  provided  to 
protect  the  whole  system,  plant  personnel  and  members  of  public.  A  view  showing  more 
details  of  the  reactor  is  shown  in  figure  2. 

The  phwr  system  has  been  an  ideal  choice  for  our  power  programme.  It  is  based  on 
natural  uranium  and  heavy  water.  Technology  for  both  has  now  been  developed  and 
demonstrated  on  industrial  scale.  Manufacturing  technology  for  various  equipment 
has  also  been  successfully  mastered.  The  system  has  a  number  of  safety  features  not 
available  in  other  reactor  types.  These  are:  (i)  The  actuators  for  reactor  protection  act 
in  low  pressure  environment  unlike  the  popular  pwr  or  bwr  reactors.  Thus  phwrs  are 
immune  to  accidents  that  can  be  caused  by  ejection  of  neutron  absorber  from  the  core, 
(ii)  The  excess  reactivity  of  the  core  is  not  required  to  be  high  since  on-power  refuelling 
is  available  to  compensate  for  the  fuel  burn-up  at  any  time,  (iii)  Cold  moderator  is 
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Figure  2.  Pressurised  heavy  water  reactor. 
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always  available  in  the  core.  In  extreme  situations  this  heat  sink  comes  into  play  in 
removing  heat  from  the  fuel,  thus  limiting  maximum  temperature  to  a  lower  value, 
(iv)  High  pressure  reactor  pressure  vessel  is  eliminated. 

In  contrast  the  channel  type  design  of  phwr  presents  some  safety  considerations  that 
are  different  from  those  of  other  designs.  These  are:  (i)  provision  of  adequate  cooling  to 
all  channels  under  accident  situations  (ii)  effects  of  the  possibility  of  fuel  coming  in 
contact  with  the  pressure  boundary,  (iii)  introduction  of  positive  reactivity  in  an 
unlikely  loss  of  coolant  accident  event. 

The  safety  criteria  to  be  used  for  assessment  of  the  system  therefore  have  to  consider 
the  above  characteristics  which  are  distinct  and  specific  to  phwr.  Need  for  additions  or 
improvements  has  also  to  be  judged  based  on  these  characteristics. 


3.  Evolution  of  safety  approach 

The  safety  approach  has  to  encompass  a  number  of  situations.  During  normal 
operation  the  permissible  maximum  individual  dose  limits  as  well  as  population  dose 
limits  have  to  be  satisfied.  In  this  category  one  would  also  include  all  off  normal 
situations  of  rather  frequent  occurrence.  Next  we  have  to  take  care  of  various  accident 
situations.  Clearly  both  failure  probability  and  consequences  have  to  be  considered. 
The  known  difficulties  with  regard  to  certain  accident  situations  like  stagnation  breaks, 
anticipated  transients  without  scram  and  common  cause  effects  of  earthquake  have  led 
to  the  philosophy  of  two  group  concepts  with  each  function  being  provided  for  by 
independent  systems  from  each  group  (see  figure  3).  With  this  philosophy  it  is  possible 
to  categorise  accident  situations  in  two  categories  (Atchison  et  al  1983).  One  category 
consisting  of  what  are  called  serious  process  failures  (single  failure)  and  the  other 
consisting  of  events  which  are  dual  failures  i.e.  failure  of  the  process  equipment  along 
with  failure  of  a  safety  system.  In  the  later  case  accident  consequence  is  mitigated  by  the 
second  safety  system. 

The  next  step  is  to  assign  maximum  frequencies  and  the  reference  dose  limits  for  each 
of  the  above  two  categories  and  to  ensure  that  possibility  of  triple  failure  i.e.  failure  of 
process  as  well  as  safety  systems  in  both  groups  is  made  acceptably  small  i.e.  around 
10~7/yr. 

Talking  in  engineering  terms,  the  following  guidelines  evolve,  (i)  The  design, 
construction  and  operation  of  all  components,  systems  and  structures  essential  to 
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Process  system  -  Reactor  regulation 
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-  ECCS 
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Figure  3.  Two-group  concept  for  safety  system. 
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safety  of  the  reactor  will  follow  the  best  applicable  codes,  standards  or  practices  and 
confirmed  by  an  independent  audit,  (ii)  The  quality  and  nature  of  the  essential  process 
equipment  will  be  such  that  the  total  of  all  serious  process  failures  should  not  exceed  a 
specified  number.  A  serious  process  failure  in  this  context  is  a  failure  of  process  system 
or  equipment  that,  in  the  absence  of  safety  system  action,  could  lead  to  fuel  failure  or 
the  release  of  radioactive  material  to  the  environment,  (iii)  The  safety  systems 
belonging  to  two  groups  will  be  physically  and  functionally  separate  from  the  process 
system  and  from  each  other  and  (iv)  each  safety  system  will  be  readily  testable  and  will 
be  tested  at  a  frequency  that  demonstrates  its  unavailability  to  be  less  than  a  specified 
number. 

Consideration  of  various  dual  failures  also  leads  to  definition  of  performance 
requirements  for  safety  systems.  For  example,  a  loss  of  coolant  accident  (loca)  plus 
failure  of  emergency  core  cooling  system  will  lead  to  release  of  fission  products  from  the 
fuel  (the  “source  term”)  that  must  be  accommodated  by  the  containment  system. 
Similarly,  a  loca  with  impaired  containment  system  sets  the  effectiveness  required  of 
the  eccs.  Some  of  the  concerns  which  are  not  satisfactorily  answered  following  the 
single/dual  failure  approach  are: 

(a)  inability  to  consider  large  variation  in  frequency  and  consequences  of  different 
single/dual  failures.  To  overcome  this  difficulty  the  event  frequency  vs  permissible  dose 
limit  acceptance  curve  is  divided  into  a  larger  number  of  steps; 

(b)  problems  relating  to  safety  system  support  features  such  as  electrical,  air  or  water 
supplies  whose  failure  could  result  in  failure  of  both  process  as  well  as  safety  system; 

(c)  need  to  ensure  continued  operation  of  safety  system  after  an  accident  and 

(d)  need  to  include  consideration  of  accidents  like  earthquake  which  could  damage 
both  process  and  safety  system. 

A  more  comprehensive  approach  to  safety  evaluation  is  thus  needed.  This  leads  to  the 
next  stage  of  evolution — which  is  based  on  use  of  safety  design  matrix  (sdm).  It  is 
essentially  a  systematic  record  of  visual  inspection  by  the  analyst  of  all  selected  events  of 
potential  safety  concern/  Extensive  use  of  fault  tree  analysis  and  event  sequence 
diagram  enables  identification  of  possible  causes  for  the  event  and  also  the  various 
consequences,  sdm  enables  better  understanding  of  the  system  behaviour  and  has  the 
potential  to  identify  desirable  design  modifications.  This  approach  can  be  extended  to 
predict  the  risk  posed  by  any  postulated  event  sequence. 

Progress  is  being  made  towards  application  of  probabilistic  risk  assessment  (pra) 
techniques.  With  development  of  this  technique  and  also  appropriate  data  base  it 
appears  that  pra  would  have  a  strong  influence  on  licensing  decisions  in  future. 


4.  Engineering  development 

Engineering  development  is  a  process  of  continuous  evolution.  The  availability  of  new 
technologies,  evolution  of  new  safety  criteria,  experience  with  design,  construction, 
commissioning  and  operation  of  power  reactors,  availability  of  new  information  with 
regard  to  design  techniques,  material  behaviour,  equipment  reliability,  understanding 
of  accident  behaviour  etc  all  have  an  impact  on  the  new  engineering  development  effort. 
As  mentioned  earlier  the  phwr  system  is  expected  to  perform  well  both  in  terms  of 
safety  and  in  terms  of  its  performance  as  a  power  unit.  However,  there  is  always  a  scope 
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to  improve  the  system  through  new  development.  Let  us  now  review  some  of  the 
important  areas  in  greater  detail. 


5.  Emergency  core  cooling  system 

From  the  point  of  view  of  performance  of  emergency  core  cooling  system  the  distinctive 
features  of  phwr  cooling  system  in  contrast  to  the  more  popular  pwr  systems  should  be 
recognised.  There  are:  (i)  The  length  of  large  diameter  piping  is  quite  small  in  phwrs.  In 
contrast  the  small  diameter  piping  is  far  more  extensive  in  phwr.  The  phwrs  have  thus  a 
larger  probability  for  small  break  loca.  (ii)  The  coolant  tubes  which  contain  and  also 
support  the  fuel  are  horizontal.  They  can  deform  into  different  modes  depending  on  the 
pressure  temperature  history,  (iii)  The  primary  heat  transport  system  is  arranged  in  a 
figure  of  eight  configuration  (figure  4).  Certain  combinations  of  size  and  location  of 
break  can  cause  stagnation  conditions  in  the  core,  (iv)  Voidage  of  core  introduces 
positive  reactivity. 

It  is  thus  clear  that  the  system  behaviour  in  case  of  loca  in  a  phwr  is  likely  to  be  quite 
different  from  the  pressure  vessel  type  reactors  for  which  most  of  the  published 
literature  is  relevant.  Thus  while  the  design  of  eccs  goes  on  based  on  the  best  available 
calculational  procedures,  an  extensive  experimental  programme  to  study  various  effects 
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Figure  4.  Figure  of  eight  configuration. 
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that  would  be  manifested  during  various  stages  following  loca  is  also  being  pursued. 
This  work  can  be  divided  into  two  broad  areas.  One  relating  to  the  blowdown  phase 
and  the  second  relating  to  emergency  core  cooling  phase. 

In  the  blowdown  phase  the  main  feature  that  distinguishes  the  blowdown  in  phwr  is 
the  fact  that  the  fuel  channels  in  phwr  are  horizontal.  There  is  thus  need  to  understand 
through  experiments  the  various  phenomena  that  are  involved  in  blowdown  from 
horizontal  channels.  Apart  from  rigging  up  the  experimental  set-up,  this  also  involves 
development  of  techniques  for  measurement  of  blowdown  rates  with  a  variety  of  break 
sizes.  In  parallel  to  the  experimental  programme,  analytical  studies  aimed  at 
development  of  a  computer  program  to  calculate  transient  two-phase  flow  and  heat 
transfer  during  blowdown  from  horizontal  pipes  starting  with  subcooled  condition  and 
with  heat  addition  need  to  be  pursued  (Venkat  Raj  et  al  1983). 

In  case  of  phwrs  this  analysis  needs  to  be  extended  to  analyse  the  effects  of  failure  of 
eccs  (Hancox  1981).  In  such  an  event  the  build-up  of  temperature  would  lead  to 
deformation  of  coolant  tube  which  could  sag  into  contact  with  the  calandria  tube  if  the 
coolant  pressure  is  already  low.  Alternatively  if  the  pressure  has  not  sufficiently 
dropped  down,  the  coolant  tube  can  balloon  into  contact  with  the  calandria  tube.  The 
two  possibilities  are  pictorially  depicted  in  figure  5. 

Once  the  contact  with  the  calandria  tube  is  established  heat  can  get  dissipated  to 
moderator  and  the  maximum  fuel  temperatures  get  decided  by  the  resistances  in  this 
heat  transfer  path. 

In  the  eccs  phase,  the  specific  areas  of  concern  to  phwrs  are:  (i)  problems  caused  by 
the  injection  of  cold  emergency  coolant  in  voided  header  such  as  condensa  tion  induced 
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Figure  5.  Schematic  diagram  of  the  stages  of  fuel  channel  heat  up  and 
deformation. 
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flow  oscillations;  (ii)  flow  distribution  between  parallel  channels  under  eccs  con¬ 
ditions.  (iii)  Rewetting  of  hot  horizontal  channels. 

Here  again  extensive  experimental  data  are  required  to  enable  analysis  of  conditions 
in  the  core  during  loca.  Depending  upon  the  temperature  of  the  fuel  in  various 
channels,  it  is  possible  that  a  larger  share  of  the  flow  injected  in  the  header  may  get 
preferentially  diverted  to  the  cooler  channels  contrary  to  the  actual  requirements. 
Model  studies  are  therefore  required  to  investigate  this  phenomenon  and  to  ensure  that 
all  channels  receive  at  least  the  required  quantity  of  coolant  flow. 

Considerable  work  has  been  done  to  understand  the  rewetting  phenomenon  in 
horizontal  rectangular  as  well  as  annular  ducts  (Venkat  Raj  1984).  A  typical 
experimentally  determined  rewetting  transient  starting  with  an  initial  temperature  of 
500°C  is  shown  in  figure  6.  Comparison  of  traces  cb,  cm  and  ct  indicates  an  inclined 
rewetting  front.  Significant  stratification,  flow  chugging  and  oscillations  in  pressure 
drop  have  also  been  observed.  A  comparison  of  experimental  data  on  re  wetting  velocity 
with  theoretical  prediction  for  an  annular  horizontal  duct  is  given  in  figure  7.  The  data 
collected  during  such  tests  can  be  used  to  work  out  a  more  realistic  prediction  of  the 
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Figure  6.  Rewetting  transients  (initial  temperature  —  500°C). 


/ 


Figure  7.  Comparison  of  predicted  rewetting  velocity  for  Ants-2  with 
experimental  data. 
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integrated  system  behaviour  following  a  loca  and  the  performance  of  the  eccs.  Such  a 
behaviour  can  also  be  studied  on  specially  designed  experimental  set-ups. 


6.  Thermosyphon 

The  current  designs  of  phwrs  depend  on  the  thermosyphon  for  maintaining  the  circulation 
through  the  core  under  conditions  when  the  main  circulating  pumps  are  not  running 
and  the  system  pressure  and  temperature  do  not  permit  bringing  in  the  standby  heat 
removal  system  (pumps  and  heat  exchangers).  While  for  the  new  reactors  this  situation 
can  be  avoided  by  incorporating  a  standby  heat  removal  system  capable  of  working  at 
full  pressure  and  temperature,  a  study  of  thermosyphon  phenomenon  would  be  useful 
both  for  existing  reactor  systems  as  well  as  future  reactor  systems.  Figure  8  shows  a  plot 
of  flow  (in  terms  of  temperature  difference)  vs  time  determined  on  an  experimental  set¬ 
up  simulating  the  phwr  primary  system  layout.  This  plot  indicates  both  initial 
establishment  of  flow  as  well  as  stability  of  flow  under  single  phase  flow  conditions. 

Experimentally  determined  data  for  channel  pressure  drop  over  the  full  range  of 
Reynolds  numbers  are  shown  in  figure  9.  Similar  data  need  to  be  generated  for  the 
coolant  channel  of  500  MWe  phwr.  Such  data  are  essential  for  evaluation  of  safety  in  a 
number  of  situations  when  flow  is  less  than  normal  flow. 

Study  of  thermosyphon  in  the  phwr  system  under  two  phase  flow  conditions  also 
needs  to  be  pursued. 


O  80  160  240  320  400 


Time  (min) 

Figure  8.  Stability  test  at  different  power  levels. 


7.  Zr-2|-  %  Nb  coolant  tubes 

Our  phwrs  so  far  use  zircaloy-2  as  the  material  for  the  coolant  tubes.  This  material  was 
also  used  by  Canadians  in  their  earlier  reactors.  With  the  development  of  Zr-2^  %  Nb 
material  which  has  better  strength  and  creep  resistance  Canadians  switched  over  to  this 
new  material  starting  from  pickering-3  unit.  The  primary  objectives  have  been  to 
extend  the  creep  life  of  coolant  channels  and  effect  greater  neutron  economy. 

A  number  of  problems  have  been  faced  by  Canadians  in  the  use  of  Zr-Nb  tubes.  Most 
of  them  have  been  caused  by  delayed  hydride  cracking  caused  by  faulty  rolling 
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Figure  9.  Fuel  channel  pressure  drop  data. 
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procedure.  Although  all  such  tubes  have  been  subsequently  stress-relieved,  it  is 
anticipated  that  some  initial  cracking  might  have  been  caused  during  the  operating 
period  between  faulty  rolling  leading  to  high  residual  stress  and  the  actual  stress 
relieving  operation.  The  cracks  initiated  during  this  period  would  progressively  grow 
and  tube  failures  on  this  account  are  expected  even  in  future  for  initial  reactors  tubed 
with  Zr-Nb  tubes.  The  mechanism  of  delayed  hydrogen  cracking  has  now  been 
understood  well  and  the  new  rolling  procedures  have  been  developed  which  minimise 
the  residual  stress  thereby  eliminating  delayed  hydride  cracking  and  also  making  the 
stress  relief  operation  unnecessary. 

A  new  mechanism  has  come  to  light  consequent  to  failure  of  zircaloy-2  pressure  tube 
in  Pickering-2.  Displacement  of  garter  spring  support  between  the  coolant  tube  and  the 
calandria  tube  led  to  sagging  of  coolant  tube  in  contact  with  the  calandria  tube.  The 
cold  spot  so  developed  on  the  outside  of  coolant  tube  developed  blisters  of  hydride 
almost  along  the  line  of  contact.  A  fairly  long  partial  thickness  crack  got  generated 
through  cracks  in  these  blisters.  When  the  crack  exceeded  the  critical  dimension  there 
was  a  sudden  unstable  propagation  of  crack  leading  to  gross  failure  of  the  coolant  tube. 
This  event  has  led  to  violation  of  the  belief  held  till  then  that  any  failure  in  the  coolant 
tube  would  be  of  leak  before  break  type.  On  the  positive  side  of  this  event  were  the  facts 
that  the  calandria  tube  did  not  rupture  in  this  event  and  also  that  the  reactor  was  shut 
down  in  an  orderly  manner  demonstrating  that  such  events  can  be  safely  handled.  A 
large  amount  of  literature  is  being  published  indicating  that  the  hydrogen  pick-up  rates 
in  Zr.Nb  tubes  are  much  less  as  compared  to  zircaloy-2  tubes. 

In  view  of  the  serious  safety  implications  this  matter  has  been  very  carefully  reviewed. 
A  decision  has  been  taken  to  adopt  Zr-Nb  coolant  tubes  for  future  235  MWe  units  as 
well  as  the  500  MWe  units.  Needless  to  say  that  additional  features  like  increase  in 
number  of  garter  springs,  use  of  gartersprings  immune  to  unwanted  displacement  etc 
have  also  to  be  looked  into  at  the  same  time. 

An  extensive  programme  is  already  underway  to  develop  and  evaluate  Zr.Nb  coolant 
tube  production  at  the  Nuclear  Fuel  Complex,  Hyderabad,  and  to  develop  a  coolant 
channel  based  on  this  new  material.  Clearly  the  most  crucial  activity  in  this  work  is  the 
development  of  rolled  joint  which  would  not  pose  any  delayed  hydride  cracking 
problem. 


8.  Structural  safety 

Failure  of  piping  and  other  important  structures  could  have  significant  safety 
implication.  For  example  loca  is  caused  by  a  sudden  failure  of  piping  or  some  other 
component  in  the  primary  heat  transport  system.  Development  of  an  identifiable  leak 
well  before  sudden  fracture  is  a  preferable  situation  as  it  generates  an  early  warning  to 
enable  corrective  action.  Dependable  in-service-inspection  of  components  to  identify 
development  of  cracks  or  defects  and  their  early  repair  is  even  more  desirable.  Apart 
from  development  of  techniques  for  leak  detection,  non-destructive  examination  and  in 
situ  repair  we  need  to  be  able  to  assess  areas  prone  to  development  of  defects,  assess 
growth  of  defects  and  the  associated  safety  margin.  This  requires  work  to  be  pursued  in 
area  of  stress  analysis  particularly  with  regard  to  residual  stresses,  fracture  mechanics, 
fatigue  and  life  assessment  etc.  Some  examples  of  work  done  in  this  area  are  shown  in 
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figures  10  to  12.  Figures  10  and  11  show  the  stress  transients  during  longitudinal 
welding  of  a  cylinder  and  its  subsequent  annealing.  The  residual  stress  distribution  after 
welding  and  its  reduction  due  to  annealing  operation  can  also  be  seen  in  these  figures. 
Effect  of  existence  of  residual  stress  on  the  component  life  is  depicted  in  figure  1 2  where 
it  can  be  seen  that  with  existence  of  tensile  residual  stress,  the  cyclic  life  gets 
considerably  reduced  (compare  case  1  with  case  2).  Further  with  compressive  residual 


Figure  10.  Stress  transients  during  cylinder  welding. 
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Figure  11.  Stress  transients  during  annealing  of  welded  cylinder. 
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Residual  cyclic  life  N  (  in  thousands) 

Figure  12.  Effect  of  residual  stress  on  cyclic  life. 


stress  the  life  can  be  increased  (compare  case  1  with  case  3).  This  forms  the  basis  of 
development  of  various  techniques  like  induction  heating  for  stress  improvement,  heat 
sink  welding,  mechanical  pipe  lock  etc,  which  are  aimed  at  delaying  the  component 
failure. 

Even  assessment  of  crack  growth  and  the  critical  crack  length  becomes  considerably 
difficult  with  complications  in  geometry.  Considerable  work  has  been  done  in  this  area 
in  view  of  the  assessment  need  with  regard  to  embrittlement  of  end-shields  in  raps-i,  ii 
and  mapp-i,  as  well  as  the  feed  water  nozzles  in  taps. 

Another  aspect  of  structural  safety  pertains  to  the  behaviour  of  structures  to  seismic 
excitations.  Figure  13  shows  a  typical  structural  model  of  Dhruva  pile  block  used  for 
seismic  analysis.  A  particular  deformed  mode  shape  is  shown  in  figure  14.  Although 
sufficient  capability  exists  in  this  area,  work  is  required  to  be  pursued  to  enable 
nonlinear  analysis  and  water  sloshing  studies. 


9.  Secondary  shutdown  system 

The  current  design  of  phwrs  starting  from  Narora  reactor  incorporates  a  fast  acting 
secondary  shutdown  system.  The  system  consists  of  a  number  of  verticle  tubes  running 
through  the  reactor  through  which  a  liquid  poison  is  injected.  The  number  of  positions 
available  for  this  purpose  is  limited  due  to  severe  congestion  of  various  nozzles  on  top 
of  the  calandria.  The  maximum  negative  reactivity  worth  of  such  a  system  is  thus 
restricted. 

An  alternate  system  which  can  inject  the  liquid  poison  in  moderator  is  therefore 
considered  attractive  and  needs  to  be  developed.  With  such  a  system  the  tubes  running 
through  the  core  for  poison  injection  can  be  kept  horizontal  thereby  permitting 
effective  spatial  separation  between  the  two  shutdown  systems  (see  figure  15).  This 
system  is  expected  to  be  much  simple  and  not  so  severely  limited  in  negative  reactivity 
worth. 
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Figure  13.  Model  of  Dhruva  reactor  for  earthquake  analysis. 


10.  Pressuriser 

Pressure  control  in  current  phwrs  is  maintained  by  a  feed  and  bleed  system.  This  system 
can  cope  with  transients  to  a  limited  extent  and  also  requires  steam  side  pressure  to  be 
regulated  depending  on  the  reactor  power  to  minimise  the  swell/shrinkage  on  the 
primary  side.  The  severe  fluctuations  that  are  possible  with  our  grids  could  lead  to 
unacceptable  pressure  fluctuations  causing  reactor  trip.  A  need  has  therefore  been  felt 
to  incorporate  a  pressuriser  which  would  damp  out  the  pressure  fluctuations  and 
maintain  them  within  the  desirable  limits.  It  is  expected  that  incorporation  of  a  pressuriser 
would  lead  to  more  reliable  performance  of  the  reactor  unit. 

We  have  reviewed  various  items  of  engineering  development  which  could  lead  to 
improved  safety  characteristics  of  the  system.  The  coverage  is  by  no  means  complete. 
Further,  there  are  a  large  number  of  areas  of  development  which  are  aimed  at 
improving  the  performance.  Some  of  the  important  ones  are:  (i)  improved  fuel  with 
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Figure  14.  Plot  of  fourteenth  mode  of  model  of  Dhruva  reactor. 


Figure  15.  Secondary  shutdown  system. 
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graphite  lubrication  which  would  minimise  pellet  clad  interaction  thereby  improving 
fuel  performance  particularly  under  transients;  (ii)  fuel  handling  system  which  is  one  of 
the  most  sophisticated  systems  and  key  to  successful  performance  of  the  reactor  plant 
and  (iii)  partial  boiling  in  channels  which  would  enable  larger  power  to  be  extracted 
through  the  same  reactor  unit. 

With  accumulation  of  more  and  more  reactor  years  of  operation  the  need  to  carry 
out  maintenance  as  well  as  some  of  the  planned  activities  like  channel  adjustment, 
channel  replacement,  steam  generator  tube  plugging,  repair  on  piping  system  etc  would 
grow.  Since  these  activities  require  a  lot  of  man-rems  it  is  necessary  to  develop  methods 
which  can  enable  performance  of  these  activities  in  minimum  time  and  with  minimum 
of  man-rem  expenditure. 

In  conclusion  one  can  say  that  the  existing  R  &  D  base  can  match  the  needs  of  the 
identified  power  programme.  The  efforts  should  be  continued  to  enable  further 
improvements.  Some  of  the  thrust  areas  particularly  having  a  bearing  on  safety  have 
been  identified.  These  are  considered  important  for  realisation  of  better  phwr  system. 
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Abstract.  Accelerator  development  in  India  is  reviewed  with  special  emphasis  on  indigenous 
effort.  It  started  with  the  4  MeV  cyclotron  at  the  Saha  Institute  of  Nuclear  Physics,  grew 
substantially  with  the  installation  of  the  Van  de  Graaff  accelerators  at  Trombay  and  really 
came  of  age  with  the  224  cm  Variable  Energy  Cyclotron  at  Calcutta,  which  resulted  in 
considerable  fall-out  of  technology.  Simultaneously  electron  linac  development  has  also  taken 
place.  Thus  the  stage  is  set  for  developing  new  types  of  accelerators  such  as  the  electron  storage 
ring  synchroton,  and  a  proton/heavy  ion  synchrotron. 

Keywords.  Accelerators;  cyclotron;  Van  de  Graaff;  Tandem;  linac;  technology  fall-out. 
PACS  No.  29  15  Br,  29  20  Dt,  29  20  Hm. 


I.  Introduction 

• 

Charged  particle  accelerator  development  in  the  world  started  in  the  late  1930s  with  the 
primary  motivation  of  seeking  fundamental  understanding  about  the  structure  of  the 
atom.  In  the  last  half  a  century  there  has  been  a  phenomenal  growth  all  over  the  world 
in  the  field  of  accelerator  technology  and  a  variety  of  accelerators  have  been  developed 
for  fundamental  research  as  well  as  applications.  It  is,  therefore,  very  important  to 
review  the  development  of  accelerators  and  consider  the  future  prospects  of  accelerator 
technology  in  India. 

Accelerator  development  in  India  started  in  the  1940s,  when  a  project  was 
undertaken  at  the  Saha  Institute  of  Nuclear  Physics,  Calcutta  to  build  a  38"  cyclotron 
modelled  after  the  cyclotron  at  Berkeley  (sinp,  1983).  Except  for  the  magnet  yoke  and 
pole  pieces  which  were  imported,  all  other  components  were  fabricated  indigenously.  In 
1960,  it  delivered  an  internal  proton  beam  current  of  50-70  fiA  at  4  MeV  and  later  an 
external  beam  current  of  01  //A. 

At  the  Tata  Institute  of  Fundamental  Research,  Bombay,  some  work  was  done 
towards  building  a  1  MeV  cyclotron  in  the  1950s,  though  it  was  not  made  fully 
operational.  At  the  same  time,  the  300  keV  open  air  Van  de  Graaff  accelerator  was  built 
and  operated  (George  et  al  1959)  and  development  work  on  a  1  MeV  electron  linear 
accelerator  was  carried  out. 

Simultaneously  400  keV  Cockroft-Walton  accelerators  were  built  at  the  Saha 
Institute  and  the  Bose  Institute,  Calcutta,  a  1  MeV  Cockroft-Walton  accelerator  was 
bought  and  installed  at  the  Tata  Institute  (tifr  1945-70)  and  a  150  keV  neutron 
generator  was  built  at  the  Aligarh  Muslim  University,  Aligarh. 

For  the  first  time  a  large  accelerator  was  available  in  India  when  the  5-5  MeV  Van  de 
Graaff  accelerator  purchased  from  the  High  Voltage  Engineering  Corporation  (hvec, 
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usa)  was  installed  at  the  Bhabha  Atomic  Research  Centre,  Trombay  in  1962  (Divatia 
et  al  1962).  Though  the  accelerator  was  purchased,  the  switching  magnet  and  the  beam 
transport  system  were  built  indigenously.  Some  time  later,  a  2  MeV  Van  de  Graaff 
accelerator,  also  purchased  from  the  hvec,  was  installed  at  the  Indian  Institute  of 
Technology  (iit),  Kanpur,  and  a  2  MeV  electron  Van  de  Graaff  accelerator  was  built 
indigenously  at  the  Indian  Institute  of  Science  (iisc),  Bangalore.  Using  these 
accelerators  stimulated  the  scientists  and  a  need  for  bigger  accelerators  was  felt.  H  J 
Bhabha  convened  a  meeting  of  scientists  on  3  August  1964  at  the  use  to  consider  this 
question.  After  the  deliberations,  he  summed  up  as  follows: 

“I  think  the  result  of  this  meeting  has  been  extremely  instructive  and  very  fruitful.  I 
am  convinced  that  there  is  a  large  field  of  operation  in  nuclear  physics  where  fruitful 
work  can  be  done  with  these  machines.  The  following  general  situation  seems  to  emerge 
in  my  mind.  One  is  that  the  two  machines  the  Tandem  and  the  avf  accelerator  are 
complimentary.  It  may  be  possible  for  an  avf  machine  with  the  help  of  elaborate 
equipment  to  do  what  a  Tandem  does.  If  we  decide  to  go  in  for  a  Tandem,  it  seems  clear 
that  we  should  buy  one.  The  technology  of  an  avf  is  entirely  different  and  this-  is  the 
field,  except  for  the  cyclotron  at  Calcutta,  which  has  been  neglected  in  India.  This  is  the 
field  which  I  think  should  now  be  developed.  We  are  in  a  position  now  to  enter  in  a  new 
field  of  building  of  accelerators.  Therefore,  I  myself  favour  the  idea  of  going  in  for  an 
avf  machine.” 


2.  Variable  energy  cyclotron  at  Calcutta 

The  Atomic  Energy  Commission  decided  in  1968  to  construct  a  224  cm  diameter 
Variable  Energy  Cyclotron  (vec)  at  Calcutta  indigenously  adopting  the  design  of  the  88 
inch  cyclotron  at  the  Lawrence  Berkeley  Laboratory. 

Specifications  of  the  cyclotron  are  given  in  table  1.  An  overall  view  of  the  cyclotron 
and  the  beam  transport  lines  after  construction  is  seen  in  figure  1,  and  a  close  up  of  the 
cyclotron  is  seen  in  figure  2.  Various  stages  of  progress  of  the  vec  have  been  reported  at 
International  Conferences  on  Cyclotrons  and  their  applications  (Divatia  et  al  1984; 
Chatterjee  et  al  1981;  Divatia  1979).  A  brief  description  of  various  components  of  vec 
and  the  technologies  involved  is  presented  here. 


Table  1.  Design  parameters — vec  Calcutta. 


Beam 

Hill  gap 

1905  cm 

Maximum  energy 

130  Q2/A  MeV 

Valley  gap 

29-97  cm 

Internal  beam 

1  mA 

Maximum  hill  field 

211  kGa 

current 

External  beam 

current 

Energy  resolution 

100  nA 

Maximum  valley 
field 

Acceleration  system 

141  kGa 

a)  Unanalysed 

0-5%  (FWHM) 

Dee 

1;  180° 

beam 

b)  Analysed  beam 

0-024%  (FWHM) 

Frequency  range 

5*5-16-5  MHz 

(1  mm  slit) 

Dee  voltage 

70  kV  (Max) 

Magnet 

Pole  diameter 

224  cm 

Extraction  system 

Spiral  sections 

3 

Deflector  voltage 

120  kV  (Max) 
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Figure  1 .  Overall  view  of  the  vec,  Calcutta  and  the  high  intensity  beam  transport  lines  in  the 
vault. 
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Figure  2.  A  close-up  of  the  cyclotron  showing  the  main  magnet  with  the  main  coil  and  the 
first  quadrupole  magnet  near  the  exit  port  on  the  dee  chamber. 
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2.1  Magnet 

The  most  challenging  job  from  the  standpoint  of  indigenous  effort  was  that  of 
constructing  the  260  tonne  magnet.  Since  large-sized  rolled  plates  of  the  requisite 
quality  and  size  were  not  available, was  decided  to  cast  the  necessary  steel  blocks,  and 
machine  them.  The  Heavy  Engineering  Corporation  (hec),  Ranchi  was  the  only  agency 
capable  of  delivering  the  job,  and  they  agreed  to  do  it.  They  were  highly  successful  in 
casting  the  magnet-quality  steel,  with  carbon  content  less  than  0T  %.  Table  2  shows  a 
comparison  of  the  steel  composition  achieved  and  that  specified.  The  machining  of  the 
magnet,  including  the  3  spiral-shaped  pole  tips  of  the  magnet  was  also  carried  out 
successfully  to  the  required  accuracy.  Figure  3  shows  the  assembled  magnet  and  figure  4 
shows  the  accuracy  achieved  in  the  magnet  gap. 


Table  2.  Comparison  of  specified  and  achieved  steel  com¬ 
position  for  vec  magnet. 


Specified 
composition  (%) 
(maximum) 

Achieved 
composition  ( %) 
(typical) 

Carbon 

0-12 

Oil 

Silicon 

0-35 

0-25 

Manganese 

0-50 

0-30 

Phosphorous 

0-04 

0011 

Sulphur 

0-05 

0-023 

Iron 

Balance 

Balance 

Figure  3.  The  assembled  260  tonne  main  magnet  frame  and  the  main  coils  during  assembly 
stage. 


Accelerator  development  in  India 


231 


Figure  4.  Deviation  (<$)  from  the  required  magnet  gap. 


The  main  magnet  coils,  made  of  etp  copper  of  29  cm  square  cross-section  with  a 
1  *9  cm  hole  in  the  centre  for  cooling  water,  providing  5,60,000  ampere  turns,  were  made 
by  the  Bharat  Heavy  Electricals  Limited  (bhel),  Bhopal  to  the  required  accuracy. 
Similarly  the  technology  of  winding  and  epoxy  potting  the  trim  and  valley  coils  was  also 
evolved  at  the  bhel  and  fabrication  completed  to  full  satisfaction. 

The  main  magnet  coils,  1 7  trim  coils  and  5  sets  of  valley  coils,  all  require  independent 
power  supplies  with  stabilized  currents.  The  building  of  these  power  supplies  was 
totally  an  in-house  effort,  undertaken  by  the  personnel  of  the  vec  Centre.  For  some  of 
the  power  supplies  special  water  cooled  transformers  were  built.  Table  3  shows  the 
specifications  of  these  power  supplies  and  figure  5  shows  the  inside  of  a  trim  coil  power 
supply.  The  peak  power  consumption  for  the  main  magnet  alone  is  about  1  MW. 


Table  3.  Power  supplies  for  vec. 


Unit 

No. 

Ratings 

Remarks 

Main  coil 

1 

150  V,  2800  A 

scr  controlled  current  regulated  to  0-01  % 

Trim  coils 

17 

6-24  V,  750-2500  A 

Series  transistor  current  regulated  to  0  01  % 

Valley  coils 

5 

15  V,  300  A 

Unregulated 

Oscillator 

1 

20  kV,  20  A 

Series  tube  fast  crowbar 

Ion  source 

1 

600  ARC  Voltage 

Series  tube  scr  controlled 

500  A  filament  current 

Deflector 

2 

120  kV,  5  mA 

Voltage  multiplier 

Switching  magnet 

2 

50  V,  300  A 

SCR  controlled  0  0 1  % 

Analysing  magnet 

1 

150  V,  500  A 

scR/transistor  nmr  sensing  0  001  °0 

Quadrupole  magnet 

5 

30  V,  300  A 

Regulation  01  % 
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Figure  5.  A  typical  inside  view  of  a  regulated  power  supply. 


The  total  quality  of  the  magnet  can  best  be  judged  by  examining  the  B  vs  I  curve, 
shown  in  figure  6.  It  can  be  seen  that  the  curve  is  in  excellent  agreement  with  a  similar 
curve  for  the  Berkeley  cyclotron  magnet. 

2.2  Radiofrequency  system 

The  radiofrequency  system  of  the  vec  is  a  400  kW  rf  oscillator  system  in  the  frequency 
range  5*5—1 6*5  MHz,  using  panel  movement  for  frequency  variation.  The  resonator 
tank,  made  from  38  mm  thick  copper  clad  steel  plates,  measuring  2-36  m  x  3-13  m 
x  2-39  m,  was  fabricated  at  the  Garden  Reach  Shipbuilders  and  Engineers  Ltd. 
Calcutta  and  the  dee,  dee  stem  and  panels,  made  from  etp  copper,  were  fabricated  at  the 
Central  Workshops,  barc,  Bombay.  The  rf  panels  inside  the  resonator  tank  are  shown 
in  figure  7. 


MAG.  FIELD,  8,  KIIOGAUSS 
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MAIN  COIL  CURRENT,  I ,  AMP 

Figure  6.  B  vs  I  characteristics  of  the  main  magnet;  lower  portion  shows  average  magnetic 
field  B  vs  radius. 


Figure  7.  Radiofrequency  movable  panels  and  the  dee  stem  inside  the  resonator  tank. 
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In  the  earlier  stages,  an  rf  self-excited  oscillator  system  based  around  the  rca  6949 
triode  was  used.  Later  on,  for  reasons  of  economy  and  more  reliable  and  stable 
operation,  an  rf  system  of  mopa  configuration  with  the  rca  4648  tetrode  has  been  used. 
Figure  8  shows  a  block  diagram  of  the  system.  The  high  voltage  anode  power  supply  for 
the  rf  system  was  built  by  the  in-house  vecc  group. 


Figure  8.  Block  diagram  of  the  radiofrequency  system. 


2.3  Vacuum  system 

The  combined  volume  of  the  dee  chamber  and  the  resonator  tank  is  23  m3,  and  when 
the  vacuum  system  for  evacuating  this  volume  to  a  pressure  in  the  range  of  10-6  torr 
was  built,  it  was  the  largest  one  built  indigenously.  There  are  two  89  cm  oil  diffusion 
pumps,  backed  by  roots  and  rotary  pumps,  with  proper  arrangements  of  freon  cooled 
chevron  baffles  and  gate  valves.  The  unbaffled  pumping  speed  of  each  pump  is  42,000 
litres  per  second.  This  pump  under  initial  testing  is  shown  in  figure  9.  The  pumps, 
baffles  and  the  gate  valves  were  made  at  the  Technical  Physics  Division,  barc. 
Subsequently,  this  technology  has  been  transferred  to  the  Indo-Burma  Petroleum  Co. 
and  many  large  vacuum  systems  have  been  built  by  them  for  the  Indian  Space  Research 
Organisation  (isro)  and  other  agencies. 

2.4  Ion  source ,  probes  and  deflector 

The  ion  source  is  a  hot  cathode  pig  type  and  it  is  mounted  on  a  3-6  m  long  shaft  which  is 
inserted  through  a  central  hole  at  the  bottom  of  the  magnet.  There  is  an  air-lock  so  that 
the  ion  source  can  be  pulled  out  for  filament  replacement  without  disturbing  the  main 
vacuum.  A  view  of  the  ion  source  assembly  below  the  magnet  is  shown  in  figure  10. 
There  are  three  water-cooled  copper  probes — the  target  probe,  the  dee  probe  and  the 
deflector  probe,  for  monitoring  the  beam  current  as  a  function  of  the  radius,  as  well  as 
for  measuring  the  radial  and  vertical  characteristics  of  the  beam.  All  these  components 
are  remote  controlled  from  the  control  room. 

The  electrostatic  deflector  has  three  water-cooled  cylindrical  electrodes  and  a 
tungsten  septum.  The  electrodes  can  be  moved  to  vary  the  gap  by  remote  control, 
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Figure  9.  89  cm  oil  diffusion  pump  under  initial  testing. 


through  servo-motors.  A  120  kV,  5  mA  power  supply  provides  the  necessary  electrical 
field  in  the  deflector  gap. 

The  ion  source,  probes  and  the  deflector  were  made  by  the  joint  efforts  of  the  Central 
Workshops,  barc  and  the  vecc  Workshop. 

2.5  Beam  transport  and  data  processing  systems 

The  beam  transport  system  consists  of  three  high  intensity,  low  resolution  experimental 
channels  which  are  operational  and  six  high  resolution,  low  intensity  experimental 
channels,  which  are  under  installation.  There  is  one  switching  magnet  giving  the  three 
high  intensity  channels,  and,  after  an  analysing  magnet,  another  switching  magnet 
giving  the  six  high  resolution  channels.  The  steel  for  the  switching  and  analysing 
magnets  was  cast  at  the  hec,  Ranchi  and  machining  has  been  done  in  various 
engineering  firms.  The  coils  for  the  analysing  magnet  have  been  fabricated  by  the  bhel, 
Bhopal,  whereas  the  coils  for  the  first  switching  magnet  have  been  made  at  the  vecc 
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Figure  10.  A  view  of  the  ion  source  assembly  below  the  magnet. 


Workshop.  Figure  1 1  shows  the  first  switching  magnet  under  installation.  A  large 
number  of  quadrupole  magnets  are  required  for  focussing  the  beam  at  various  stages  of 
the  beam  transport  system.  A  typical  quadrupole  magnet  is  shown  in  figure  12.  The 
design  and  fabrication  of  some  quadrupole  magnets  has  been  done  at  vecc  and  a  large 
number  have  been  fabricated  at  commercial  workshops,  under  vecc  supervision.  Many 
elements  of  the  beam  transport  system,  such  as  collimating  slits,  beam  viewers,  faraday 
cups,  etc  have  been  fabricated  entirely  at  vecc.  Figure  13  shows  a  completed  beam 
transport  line  in  the  experimental  area. 

Facilities  for  research  at  vecc  include  a  915  mm  scattering  chamber  fabricated  at 
vecc,  a  target  facility  for  making  a  variety  of  targets,  a  detector  facility  for  fabricating 
various  surface  barriers  and  Si(Li)  detectors  and  an  electronics  facility  for  producing 
nuclear  instrument  modules. 
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Figure  11.  The  first  switching  magnet  under  installation;  the  main  magnet  is  seen  in  the 
background. 


Figure  12.  A  typical  quadrupole  magnet. 


2.6  Low  conductivity  water  system 

Since  the  many  magnet  coils,  power  supplies  and  the  rf  system,  all  require  extensive 
cooling,  a  1000  gpm,  200  psi,  low  conductivity  (conductivity  1  /zmho-cm)  water  cooling 
system  has  been  installed.  Figure  14  shows  the  storage  tanks  and  the  heat  exchangers  of 
the  cooling  system. 


238 


A  S  Divatia  and  C  Ambasankaran 


'  , 


Figure  13.  A  completed  beam  transport  line  in  the  experimental  area. 


Figure  14.  Low  conductivity  water  system. 


2.7  Control  system 

Control  and  monitoring  of  all  the  cyclotron  systems  is  done  remotely  from  the  control 
console  in  the  control  room,  shielded  by  adequately  thick  concrete.  A  view  of  the 
control  console  is  shown  in  figure  1 5. 
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Figure  15.  A  view  of  the  control  room  with  various  controls  and  instrumentation  panels;  the 
video  monitors  display  the  beam  spot  on  an  alumina  beam  viewer  and  the  cyclotron  central 
region. 


2.8  Cyclotron  operation  and  utilization 

Regular  cyclotron  operation  has  been  possible  only  after  the  availability  of  uninter¬ 
rupted  power  at  vecc  from  September  1981.  The  utilization  programme  has  been 
gradually  stepped  up.  During  1983,  the  cyclotron  was  scheduled  for  operation  for  3626 
hours;  41  %  of  this  time  was  available  for  experiments  and  1 7  %  was  available  for  beam 
development  and  tuning;  the  rest  of  the  time  comprised  on-line  repairs,  machine  start¬ 
up  and  stopping  and  grid  power  failure.  He++  beams  in  the  range  of  30  MeV  to 
75  MeV  were  utilized.  Details  regarding  beams  obtained  and  utilized  are  given  in  table 
4.  During  1984  the  cycle  time  of  cyclotron  operation  has  been  further  increased. 

The  utilization  of  the  cyclotron  has  covered  various  fields,  such  as  nuclear  physics, 
solid  state  physics,  radiation  damage,  chemistry  and  analytical  chemistry,  radiochem¬ 
istry,  isotope  production,  biophysics  and  metallurgy. 

As  many  as  1 7  research  institutions  and  universities  all  over  the  country  have  already 
utilized  the  cyclotron  for  experiments.  They  are  the  Saha  Institute  of  Nuclear  Physics, 
Tata  Institute  of  Fundamental  Research,  Bhabha  Atomic  Research  Centre,  vec  Centre, 
Reactor  Research  Centre,  Indian  Association  for  the  Cultivation  of  Science,  Bangalore 
University,  Mohanlal  Sukhadia  University,  Udaipur,  Mysore  University,  Banaras 
Hindu  University,  Aligarh  Muslim  University,  Calcutta  University,  Kalyani 
University,  Punjab  University,  Burdwan  University,  Jadavpur  University  and  iit, 
Kanpur. 

2.9  Technology  fall-out  from  vecc 

There  has  been  considerable  fall-out  of  technology  from  the  project  of  constructing  the 
Variable  Energy  Cyclotron  indigenously  as  summarised  in  table  5.  Large  industries, 
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Table  4.  Alpha  beam  information. 


Energy  (MeV) 

Beam  current 

(M) 

30 

14 

35 

11 

40 

11 

45 

10 

50 

7-5 

55 

5 

60 

5 

65 

4 

70 

2 

75 

2 

80 

2 

85 

1 

100* 

2 

Energy  resolution  : 

0-5%  (FWHM) 

Emittance,  horizontal  : 

307r  mm-mrad 

vertical  : 

207i  mm-mrad 

Pulse  width  : 

10°  rf  (optimized  at 

40  MeV) 

Status — external  in  all  cases,  except  where  marked 
with  an  *. 

Table  5.  Fall  out  of  technology  from  vecc. 


Casting  of  steel  with  01  %  carbon  content  at  hec,  Ranchi  for  achieving  high  magnetic  fields. 

Epoxy  potting  of  main,  trim  and  valley  coils  (Cu  conductors)  of  cyclotron  at  bhel,  Bhopal. 

Precision  machining  of  main  magnet  at  hec,  Ranchi. 

Fabrication  of  89  cm  dia  diffusion  pumps  with  their  chevron  baffles  for  the  first  time  in  India. 

Fabrication  of  high  voltage,  high  current,  high  frequency  epoxy  cast  transformers  for  use  in  vecc  and  other 
dae  units. 

Fabrication  of  water-cooled  high  current  transformers. 

Fabrication  of  high  current  (3000  A),  stabilised  (1  in  105)  power  supplies  for  cyclotron  magnet. 
Fabrication  of  high  power  (300  kW),  wide  band  (5-20  MHz)  amplifiers. 

Fabrication  of  precision,  high  current  sensing  transformers. 

Fabrication  of  special  magnets  such  as  quadrupole  magnets,  switching  magnets,  analysing  magnet  and 
steering  magnets. 

Development  of  semiconductor  detectors. 

Development  of  nuclear  instrument  modules  for  nuclear  physics  experiments. 


both  public  and  private  sector,  as  well  as  small  industries  have  been  involved  in  the 
process,  and  in-house  technological  capabilities  have  been  increased  while  developing 
new  processes. 

3.  Variable  energy  cyclotron  at  Chandigarh 

The  66  cm  Variable  Energy  Cyclotron  at  Punjab  University,  Chandigarh  is  one  of  the 
earlier  cyclotrons  made,  operating  around  1953-54  at  the  University  of  Rochester, 
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Rochester,  ny,  usa  (Punjab  University  1979).  It  was  transferred  first  to  the  Kurukshetra 
University  and  then  to  the  Punjab  University,  where  it  was  installed  after  modifications 
and  made  operational  in  1973. 

This  is  a  classical  single  dee  cyclotron  with  arrangements  for  frequency  variation 
from  10  to  20  MHz  and  magnetic  field  upto  14  kiioGauss.  Typical  external  beams 
available  on  target  are 


Protons 
Deuterons 
4He+  + 
3He+  + 


2  pA  (max),  1-5  MeV 

1  pA  (max),  1-4  MeV 

lOOnA  (max),  1-2  MeV 

100  nA  (max),  4—1 1  MeV 


4.  2  MV  Tandem  Van  de  Graaff  accelerator 


Operation,  maintenance  and  development  of  the  5-5  MeV  Van  de  Graaff  accelerator  at 
Trombay  has  generated  experience  in  the  technology  of  electrostatic  accelerators. 
Using  this  experience,  a  fully  indigenous  2  MV  Tandem  accelerator  has  been  designed 
and  constructed  at  Trombay,  (Betigin  et  ai  1982;  Singh  et  ai  1982). 

There  are  two  support  columns,  each  consisting  of  49  sections,  each  section  being 
separated  by  9  unglazed  ceramic  insulators  of  25  mm  thickness,  bonded  by  a 
commercially  available  two-component  adhesive.  The  pressure  vessel  contains  a  N2 


Figure  16.  Column  structure  details  and  the  accelerating  tube  of  2  MV  Tandem  Van  de 
Graaff  at  Trombay. 
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and  C02  gas  mixture  at  1 7*6  kg/cm.  A  nylon  endless  belt  is  used  for  carrying  charge. 
The  accelerating  tube  consists  of  index  glass  rings  and  stainless  steel  electrodes.  A 
negative  ion  source,  an  einzel  lens,  a  pre-selector  magnet,  a  quadrupole  magnet  and  an 
analysing  magnet,  in  conjunction  with  the  accelerating  tube,  comprise  the  beam 
handling  system.  Figure  16  shows  the  column  structure  details  and  the  accelerating 
tube. 

The  beams  obtained  are  protons:  1  //A;  16Q2  +  :  200  nA;  and  1603  +  :  40  nA. 


5.  4  MeV  400  Rad  standing  wave  linac  at  Bombay 

A  4  MeV  400  Rad  standing  wave  linac  has  been  designed,  developed  and  fabricated  at 
the  Tata  Institute  of  Fundamental  Research;  it  has  been  installed  at  the  Vikram 
Sarabhai  Space  Centre  (vssc),  Trivandrum  for  x-ray  radiography  and  is  in  operation 
since  August  1983  (Syunry  1983).  A  side  view  of  the  linac  is  given  in  figure  17.  The  x-ray 
head  is  manouverable  in  all  directions  so  that  the  x-ray  beam  can  be  directed  at  any 
part  of  the  heavy  fabricated  items  such  as  large  castings  and  rocket  motors.  The 
operational  parameters  of  the  linac  are  given  in  table  6. 


Figure  17.  A  side  view  of  the  4  MeV  standing  wave  linac  at  tifr. 


6.  8  MeV  microtron  accelerator,  Pune 

An  8  MeV  microtron  has  been  built  and  installed  at  the  University  of  Poona  (Asgekar 
1980).  It  gives  an  internal  electron  beam  current  of  10  //A,  and  an  external  beam  current 
of  1  pA  at  an  extraction  radius  of  40  cm.  The  electron  pulses  are  of  2  /isec  duration,  and 
the  repetition  rate  is  50  sec-1. 


Accelerator  development  in  India 


243 


Table  6.  Operational  parameters  of  the  4  MeV  standing  wave  linac  at  tifr. 


Maximum  energy  of  x-rays 
Dose  rate  (max) 

Focal  spot  size 
Collimator  angle 
Field  size  at  one  metre 
Half  value  thickness 
Size  of  x-ray  head 
Weight  of  x-ray  head 
Beam  centering 
Movement  of  x-ray  head 


250  Rad/min  at  one  metre 

2  mm  dia 

30° 

560  mm  dia 
25  mm  of  steel 
760  x  760  x  1700  mm 
1000  kg 

By  laser  beam  direction  finder 
Telescopic  vertical  upto  5  m; 

±175°  about  vertical  axis; 

+  45°  to  —  95°  about  horizontal  axis; 
Long  and  cross  travel  by  EOT  crane 


4  MeV 


7.  Future  accelerators 

A  2  MV  Tandem  accelerator,  exactly  similar  to  the  2  MV  Tandem  accelerator  at 
Trombay,  is  being  fabricated  at  the  Reactor  Research  Centre,  Kalpakkam,  and  it  is 
expected  to  be  commissioned  in  1985. 

A  14  ud  Pelletron  purchased  from  the  National  Electrostatic  Company  (nec),  usa,  is 
being  installed  at  the  Tata  Institute  of  Fundamental  Research,  Bombay  by  barc  and 
tifr.  It  will  accelerate  protons  to  28  MeV,  alphas  to  42  MeV  and  heavy  ions  such  as 
28Si,  32S,  40A  and  64Ni.  This  accelerator  is  expected  to  be  operational  in  1985.  A 
superconducting  linac  booster  is  planned  as  a  post-accelerator. 

A  major  programme  of  accelerator  development  is  envisaged  at  the  Centre  for 
Advanced  Technology,  barc,  Indore.  Proposals  are  being  studied  for  an  electron 
storage  ring  synchrotron,  a  proton/heavy  ion  synchrotron  (Jain  and  Divatia  1977)  and 
a  medical  cyclotron. 


8.  Conclusion 

A  variety  of  accelerators  have  been  designed,  developed  and  built  in  India.  Wherever 
accelerators  have  been  purchased,  associated  development  work  has  been  necessary. 
Construction  of  the  Variable  Energy  Cyclotron  at  Calcutta  has  given  a  major  boost  to 
the  accelerator  technology  and  established  a  strong  liaison  between  Research  Institutes 
and  the  Industry. 

Besides  the  accelerators  discussed  here,  there  are  u  number  of  accelerators,  with 
energy  less  than  1  MeV,  in  Research  Institutes  and  universities,  which  are  used  as 
Neutron  Sources  or  for  ion  implantation.  Quite  a  few  medical  institutes  and  hospitals 
have  procured  electron  accelerators  for  radiation  therapy.  The  need  for  accelerators 
and  development  of  accelerator  technology  is  growing. 

Accelerator  development  in  India  has  been  pioneered  by  Meghnad  Saha,  given  a 
major  thrust  with  a  vision  by  Homi  J  Bhabha,  encouraged  by  Vikram  A  Sarabhai  and 
brought  to  a  high  technology  level  with  a  promise  for  the  future  by  Raja  Ramanna. 
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Phenomenology  of  the  Pokaran  pne  experiment 

R  CHIDAMBARAM,  S  K  SIKKA  and  SATISH  C  GUPTA 

Physics  Group,  Bhabha  Atomic  Research  Centre,  Trombay,  Bombay  400085,  India 

Abstract.  The  phenomenology  of  the  Pokaran  pne  experiment  (yield  - 12  kiloton  of  tnt) 
conducted  in  a  shale-sandstone  rock,  107  meters  underground,  is  described  with  the  aid  of 
computations  using  a  one-dimensional  spherical  symmetric  rock  mechanics  computer  code 
developed  by  the  authors.  The  calculated  values  of  cavity  radius,  spall  velocity  and  extent  of 
rock  fracturing  are  in  good  agreement  with  the  observed  values.  The  principal  mechanism  for 
crater  formation  at  Pokaran  was  spall  and  the  relatively  smaller  crater  dimensions  and  non¬ 
venting  of  radioactivity  gases  were  due  to  lower  kinetic  energy  transferred  to  the  shale- 
sandstone  rock. 

Keywords.  Pokaran;  peaceful  nuclear  explosions;  crater;  non-venting. 

PACSNo.  91-90;  91-60. 


1.  Introduction 

India’s  first  and  only  peaceful  nuclear  explosion  experiment  was  carried  out  in 
Rajasthan  desert  on  1 8  May  1974  at  a  place  near  Pokaran.  The  object  of  the  experiment 
was  to  study  the  explosion  phenomenology,  fracturing  effects  in  rocks,  ground  motion, 
containment  of  radioactivity  and  postshot  access  of  the  region  around  the  point  of 
detonation  (Chidambaram  and  Ramanna  1975).  Such  studies  are  important  in  the 
context  of  possible  applications  of  peaceful  nuclear  explosions  (pnes). 

A  plutonium  device,  of  yield  12  kt  equivalent  of  tnt,*  was  emplaced  in  a  shale 
medium,  at  a  depth  of  107  m,  in  a  chamber  at  the  end  of  an  L-shaped  hole.  Upon 
detonation,  the  ground  surface  above  the  emplacement  point  rose  with  a  velocity  of 
about  25-30  m/sec  to  form  a  dome  170  m  in  diameter  and  34  m  in  height.  The  mound 
remained  intact  during  its  growth  and  fall  and  no  venting  of  radioactive  gases  was 
observed.  The  resultant  apparent  crater,  measured  with  respect  to  the  preshot  ground 
surface,  has  an  average  radius  of  47  m  and  depth  of  10  m. 

The  phenomenology  of  the  event  is  interesting  as  it  is  the  only  recorded  case  of  an 
explosion  which  produced  a  crater,  although  a  very  shallow  one,  and  yet  was  completely 
contained  from  the  radioactivity  point  of  view.  Also,  a  comparison  of  the  Pokaran 
crater  dimensions  with  other  cratering  experiments  in  hard  rock  (figures  1  and  2)  shows 
that  the  Pokaran  crater  was  smaller.  At  its  scaled  depth  of  burial  (sdob)  51*5  m/kt1/3‘4, 
the  scaled  radius  and  depth  for  a  hard  rock  cratering  experiment  are  predicted  to  be 
(Nordyke  1970)  SRfl  =  27  m/kt 1/3,4  and  SDfl  =  14  m/kt1/3'4,  compared  to  the  values  of 
23m/kt1/3'4  and  5m/kt1/3'4  respectively  observed  for  the  Pokaran  experiment.  The 


*  1  kt  =  4  186  x  109ergs,  the  explosive  energy  equivalent  of  1000  tonnes  of  tnt. 
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Figure  1.  Scaled  apparent  crater  radius  versus  the  scaled  depth  of  burst  for  nuclear  cratering 
explosions  in  hard  rock. 


Figure  2.  Scaled  apparent  crater  depth  versus  the  scaled  depth  of  burst  for  nuclear  cratering 
explosions  in  hard  rock. 


peak  vertical  velocity  of  25  to  30  m/sec  for  the  latter  also  is  lower  than  the  value 
~  40  m/sec,  as  read  from  figure  6  of  Toman  (1970).  In  this  paper,  we  have  tried  to 
explain  these  observations  based  on  computer  calculations  using  ‘ocener’,  a  one¬ 
dimensional  spherical  rock  mechanics  code  developed  in  our  laboratory  (Gupta  et  al 
1979a;  Gupta  1979). 


The  Pokaran  pne  experiment 

2.  One-dimensional  rock  mechanics  code 
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The  computer  code  developed  by  us  has  many  similarities  with  the  American  code  ‘soc 
(Cherry  1967;  Cherry  and  Petersen  1970;  Schatz  1973)  and  the  French  code  ‘S’ 
(Michaud  and  Maury  1970).  Some  of  the  physical-mechanical  processes  associated  with 
the  propagation  of  the  stress  field  set  up  in  a  geological  medium  by  a  sudden  release  of 
the  explosive  energy  of  a  nuclear  device  (Teller  et  al  1968)  are  vaporisation,  melting, 
crushing,  fracture  and  motion  of  the  rock.  Also  the  stress  wave  gets  reflected  at  the  free 
ground  surface  and  this  transfers  additional  kinetic  energy  to  the  rock  medium.  The 
final  crater  dimensions  depend  upon  the  total  kinetic  energy  transferred  to  the  region 
above  the  explosion-produced  cavity.  Our  code  models  these  events  in  one  dimension, 
in  spherical  symmetry,  by  finite  difference  solutions  of  the  laws  of  mass,  momentum 
and  energy  conservation,  and  employs  the  physical  and  mechanical  properties  of  the 
rock  material  under  different  stress  conditions. 

In  a  given  calculation,  the  medium  surrounding  the  explosion  point  is  divided  into 
spherical  meshes  and  the  conservation  laws  are  expressed  in  a  Lagrangian  co-ordinate 
system,  so  that  the  conservation  of  mass  is  automatically  achieved.  From  the  initial 
stress  field  at  time  t,  the  momentum  equation  (equation  of  motion)  is  solved  to  ascertain 
the  acceleration  of  each  zone  boundary.  Accelerations,  when  allowed  to  act  over  a  small 
time  increment  At,  produce  new  velocities  and  subsequently  new  displacements  of  the 
zone  boundaries.  These  displacements  produce  strains  which,  depending  upon  the 
equation  of  state  of  the  rock,  produce  the  new  stress  field.  The  cycle  is  repeated  after 
each  time  increment  At.  The  steep  shock  fronts  are  handled  in  the  program  by  the 
artificial  viscosity  method  due  to  von  Neumann  and  Richtmayer  (1950).  Some  of  the 
outputs  of  this  program  are  the  following: 

4 

(i)  the  amount  of  shock-vapourised  rock, 

(ii)  the  amount  of  shock-melted  rock, 

(iii)  the  radius  of  the  cavity  and  the  pressure  inside  it  at  any  given  time, 

(iv)  profiles  of  shock  position  versus  time,  peak  stress  (both  hydrostatic  and 
deviatoric)  versus  radius,  particle  velocity  versus  radius,  etc., 

(v)  spall  velocities  of  the  mesh  points  as  the  reflected  tensile  wave  from  the  surface 
travels  towards  the  growing  cavity, 

(vi)  kinetic  energy  transferred  to  the  medium,  and 

(vii)  the  extent  and  type  of  rock  failure  (ductile  or  fracture). 

More  details  of  the  code  are  given  elsewhere  (Gupta  et  al  1979a). 


3.  Medium  properties* 

The  geology  around  the  emplacement  point  has  been  briefly  described  by 
Chidambaram  and  Ramanna  (1975).  The ‘shot  point  was  located  in  a  sedimentary 


*  The  measurements  of  the  various  medium  properties  reported  in  this  section  were  carried  out  in  several 
divisions  of  barc  and  elsewhere. 
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Ground  surface 

Broken  Sandstone 


Sqndstone 
Po  =2.10gm/c.c. 
Co  =  230  bars 
0o  =  25  7. 

W  s  2  7. 
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Po  =  2.45gm/cc 
Co  =  530  bars 
0o  =  12  7o 
W  s  47. 
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®  Detonation  point 
(107m) 


Figure  3.  Vertical  geologic  section  adopted  for  calculations. 

formation  and  the  area  surrounding  the  surface  ground  zero  was  a  flat  terrain  covered 
with  sand  and  sandy  loam  type  of  soil.  The  section  of  the  geological  structure  adopted 
for  the  calculations  is  shown  in  figure  3.  Some  of  the  measured  physical  and  mechanical 
properties  are  also  shown.  The  chemical  composition  of  the  rocks  suggests  that  the 
major  constituent  of  these  rocks  is  Si02  and  the  rock  can  be  classified  as  ‘sedimentary 
Si02  rock  type’. 

3.1  P-V  curves 


The  experimental  shock  Hugoniot  data  for  Pokaran  shale*  are  plotted  in  figure  4  along 
with  data  for  several  shales  and  sandstones1  found  in  usa.  It  can  be  seen  that  an 
‘average’  curve  for  a  shale  medium  can  be  drawn  independent  of  initial  density  and 
hence  of  porosity.  This  implies  that  the  compressibility  at  higher  pressures  does  not 
vary  much  with  minor  variations  of  the  rock  constituents  and  is  a  function  of  grain 
density  only.  This  appears  to  be  reasonable  since  with  increasing  pressure,  the  pore 
collapse  and  porosity  disappears  irreversibly  at  about  40kbar  (Stephens  et  al  1969). 
Such  averaging  of  the  Hugoniot  data  for  calculations  for  parameters  studies  using 
computer  codes  has  also  been  done  by  Hearst  (1971)  for  alluvium  and  by  Butkovich 
(1973)  for  granite,  basalt  and  tuff  rocks.  It  is  interesting  to  note  that  the  Hugoniot  curve 
calculated  for  Pokaran  shale  (p  =  2-45  g/cm3)  using  the  Butkovich’s  (1973)  prescrip¬ 
tion,  matches  well  with  the  ‘average’  curve.  The  P-V  curve  for  Pokaran  shale,  therefore. 


*  The  sample  for  these  measurements  had  density  of  2-22  g/cm3. 

+  These  data  for  other  rocks  are  taken  from  figure  2  of  Terhune  (1973). 


The  Pokaran  pne  experiment 


249 


Figure  4.  Shock  Hugoniot  for  Pokaran  shale  compared  with  various  sandstones  and  shales. 
Full  curve  is  the  one  calculated  for  p0  =  2-45  g/cm3  using  pmugen. 


was  calculated,  including  the  lower  pressure  range  (below  40kbar),  employing  a 
Trombay  program  pmugen  based  on  Butkovich’s  prescription  (Gupta  1979).  For 
Pokaran  sandstone,  since  the  shock  wave  decays  to  less  than  5  kbar  when  it  reaches 
shale-sandstone  interface,  the  P-V  curve  is  required  only  in  the  low  pressure  regime. 
This  also  is  calculated  using  the  program  pmugen. 

3.2  Strength  curves 

Since  the  facilities  to  carry  out  triaxial  tests  on  rock  samples  were  not  available,  the 
measured  values  of  unconfined  compressive  strength  of  the  rock  have  been  used  to 
estimate  the  strength  curves  employing  the  expression  given  by  Gupta  and  Sikka 
(1978). 

C/C0  =  l+k(a3/C0y,  (1) 

where  C  is  the  confined  compressive  strength  at  the  confining  pressure  of  <r3  and  C0  is 
the  unconfined  compressive  strength  (cr3  =  0).  k  and  n  are  constants  for  a  given  rock 
type.  The  values  A:  =  2*1  and  n  =  064  for  shales  and  k  =  3-2  and  n  =  067  for 
sandstones  were  employed.  The  brittle  ductile  points  are  placed  at 

Jm  =  0-8  Pm,  (2) 

following  Mogi  (1966)  for  silicate  rocks,  where  Jm  is  the  maximum  shear  stress  and  Pm  is 
the  mean  pressure. 
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Table  1.  Average  physical  properties  of 
the  shale  and  sandstone  rocks  used  for 
calculations. 


Property 

Shale 

Sandstone 

Density  (p0) 

2-45  g/cm3 

2- 10  g/cm3 

Porosity  (</>*) 

12% 

25% 

Water  content 

0-04 

0-02 

Unconfined  compressive 
strength  (bar) 

256 

60 

Tensile  strength  (bar) 

30 

19 

Ultrasonic  pulse 
velocity  (km/sec) 

2-2-95 

14-2-7 

Young’s  modulus 
(kg/cm2) 

0-8  x  105 

0-5  x  105 

Poisson’s  ratio 

010 

0-28 

*  (p  =  ( pG  —  Po)IPq,  where  pG  is  the  grain  density  and 
p0  is  the  bulk  density  of  the  rock. 


The  average  physical  and  mechanical  properties  of  shale  and  sandstone  rocks,  used  in 
the  calculation,  are  given  in  table  1. 


4.  Calculations  and  results 

The  total  energy  of  the  device,  1 2  kt  (=  5-04  x  IQ20  ergs),  was  released  in  less  than  one 
microsecond  into  the  device  material.  The  resulting  temperature  and  pressure  were 
estimated,  following  Teller  et  al  (1968),  to  be  ~  4  million  degrees  K  and  ~  70Mbar 
respectively.  At  this  temperature,  the  device  materials  would  have  got  completely 
vaporized.  This  gas  then  expanded  rapidly  and  adiabatically  to  fill  the  emplacement 
chamber. 

4.1  Shock  vaporisation  and  shock  melting 

To  determine  the  shock-vaporised  and  shock-melted  regions  of  rock,  a  fine  mesh 
calculation  was  done.  The  energy  of  the  device  ( W  =  1 2  kt)  was  distributed  uniformly  as 
internal  energy  of  sphere  of ‘iron  gas’,  whose  volume  is  equal  to  that  of  the  emplacement 
chamber.  The  vaporisation  phase  was  found  to  be  completed  at  about  1 50  ^sec  and  the 
total  mass  of  the  rock  vaporised  was  640  tons  which  extends  up  to  a  radius  of  4T  m  from 
the  shot  point.  About  2000  tons  of  the  rock,  extending  upto  a  radial  distance  of  6-2  m, 
was  shock  melted  in  about  600  /zsec.  The  amounts  of  rock  shock  vaporised  and  shock- 
melted  are  53  tons/kt  and  160tons/kt  respectively,  which  are  lower  than  the  values  of 
70tons/kt  and  300  tons/kt  respectively  calculated  for  usa  hard  rock  events  (Butkovich 
1967). 

4.2  Shock  propagation  and  cavity  growth 

These  calculations  were  performed  using  the  so-called  ‘bubble  model’  (Butkovich 
1967).  The  total  energy  of  1 2  kt  was  redistributed  uniformly  in  the  spherical  mass  of 
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Figure  5.  Computed  wave  propagation  and  cavity  growth  in  vertical  direction  for  Pokaran 
experiment. 


vaporised  rock  of  radius  41  m.  At  this  energy  density,  the  spherical  cavity  had  uniform 
pressure  of  about  1*6  megabars.  The  equation  of  state  used  for  the  vaporised  rock  was 
the  one  for  a  ‘Si02  +  1  %  H20’  gas  mixture  taken  from  Butkovich  (1967). 

The  position  of  the  first  shock,  the  rarefaction  wave  and  the  cavity  growth  are  plotted 
in  figure  5.  The  shock  position  was  taken  to  be  the  point  where  the  magnitude  of  the 
artificial  viscosity  term  was  greatest.  At  about  14  msec  the  shock  wave  reached  the 
shale-sandstone  interface.  At  this  interface  a  shock  wave  was  induced  in  the  sandstone 
layer  while  a  backward  moving  rarefaction  wave  travelled  in  the  shale  medium*.  The 
shock  wave  in  sandstone  split  at  about  62  m  from  shot  point  into  two  waves:  a  main 
shock  wave  and  a  preceeding  elastic  precursor  which  moved  with  the  sound  velocity  in 
the  medium.  It  can  be  seen  that  the  main  shock  reached  the  free  ground  surface  at  time 
ts  ~  41  msec  and  got  reflected  from  there  as  a  rarefaction  wave  and  met  the  top  of  the 
growing  cavity  at  tg  =  72  msec. 

The  radius  of  the  cavity  at  time  tg,  in  the  vertical  direction  was  24-5  m  compared  to 
23*5  m  in  a  horizontal  direction  (all-shale  calculation).  This  difference  is  caused  by  the 
rarefaction  wave  produced  by  the  reflection  of  the  first  shock  at  the  shale-sandstone 
interface.  As  upto  time  tg  (Rodinov  et  al  1 97 1 )  the  cavity  radius  is  able  to  attain  80-90  % 


*  From  this  time  onwards,  the  one  dimensional  symmetry  of  the  problem  is  lost  and  the  calculations  using 
‘ocener’  are  strictly  valid  for  the  vertical  direction  connecting  the  shot  point  and  the  surface  ground  zero. 
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of  its  final  value,  the  final  cavity  radius  in  horizontal  direction  is  estimated  to  be 
28-29-5  m.  This  is  in  comparison  with  post-shot  measured  value  of  30  m. 

4.3  Particle  velocity  profiles 

The  particle  velocity  versus  time  at  distances  27-4  m,  55  m,  80-5  m  and  107  m  from  the 
shot  points  are  plotted  in  figure  6.  The  first  outward  going  shock  wave  (indicated  by  the 
sharp  increase  in  the  particle  velocity)  appears  at  these  distances  around  6-4,  1 6,  29  and 
41  msec  respectively.  Surface  ground  zero  begins  to  move  up  with  initial  velocity  of 
30  m/sec.  This  is  in  good  agreement  with  the  observed  velocity  (25-30  m/sec). 

The  rarefaction  wave  starts  from  the  surface  ground  zero  and  moves  towards  the 
cavity  increasing  the  particle  velocities  (indicated  by  points  A  in  figure  6).  The  plot 
shows  that  the  shale-sandstone  interface  spalled  at  around  58  msec.  A  recompaction 
wave,  which  resulted  in  further  increase  in  particle  velocities  (denoted  by  points  B), 
seems  to  be  arrested  at  the  shale-sandstone  boundary.  This  would  have  reached  the 
surface  at  much  later  times  and  its  effect  on  increasing  the  particle  velocities  (and  hence 


Figure  6.  Computed  particle  velocity  versus  time  profiles  at  various  distances  for  the 
Pokaran  pne  experiment.  A  —  spall  phase,  B  —  gas  acceleration  phase. 
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on  the  final  crater  dimensions)  would  seem  minimal.  This  calculation  seems  to  suggest 
that  the  principal  mechanism  responsible  for  crater  formation  at  Pokaran  was  spall. 

4.4  Extent  of  rock  failures  (ductile  flow  and  brittle  fracture) 

The  calculation  shows  that  all  the  rock  above  the  detonation  point  ( i.e .  in  vertical 
direction)  had  failed.  In  horizontal  direction  the  extent  of  fracturing  was  determined  by 
performing  a  calculation  in  homogeneous  shale  medium  in  which  the  free  surface  had 
been  removed  far  away  from  detonation  point.  The  maximum  extent  of  fracture  radius, 
as  indicated  by  the  crack  number  equal  to  2,  is  about  1 14  m  compared  to  the  observed 
value  of  80-100  m. 

4.5  Reasons  for  smaller  crater  in  Pokaran  experiment 

Having  described  the  early  time  phenomenology  of  the  Pokaran  experiment,  we  shall  in 
this  section  examine  the  causes  for  obtaining  a  smaller  crater  compared  to  the  cratering 
events  in  hard  rocks  in  usa.  For  this  we  will  use  the  kinetic  energy  criteria  as  given  in 
Appendix  A. 

4.5*2  Comparative  study  of  cratering  in  Pokaran  medium  and  Buckboard-basalt* — First, 
to  get  a  preliminary  idea  about  the  cratering  efficiency  of  the  Pokaran  rocks,  we 
compare  in  table  2  their  physical  properties  with  those  of  Buckboard  basalt  in  which 
usa  has  carried  out  two  cratering-type  nuclear  explosions:  Danny  Boy  (yield  =  042  kt) 
and  Sulky  (0  087  kt).  The  usa  scaled  apparent  crater  radius  and  scaled  apparent  crater 
depth  versus  scaled  depth  of  burst  curves  are  essentially  based  on  these  experiments. 
The  influence  of  the  variation  of  rock  properties  on  the  cratering  efficiency  of  a  rock  has 
been  discussed  by  Terhune  et  al  (1971)  and  Gupta  et  al  (1979b).  Based  on  that,  we  have 
included  a  comment  about  the  relative  cratering  efficiency  of  Pokaran  rocks  with 
respect  to  basalt.  We  observe  that  except  for  the  water  content  the  other  properties  will 
lead  to  lowering  the  cratering  efficiency  of  Pokaran  rocks. 


Table  2.  Comparison  of  physical  properties  of  Pokaran  rocks  with  those  of 
Buckboard-basalt. 


Property 

Shale 

Sandstone 

Basalt 

Resultant 
cratering 
efficiency  of 

Pokaran  rocks 
relative  to  basalt 

Density  (g/cm3) 

2-45 

210 

2-62 

Lower 

Porosity  ( %) 

12 

25 

3 

Lower 

Bulk  modulus  (kbar) 

62 

52 

90 

Lower 

Water  content 

004 

0-02 

0-0 

Higher 

Maximum  shear 
strength  (bar) 

358 

170 

300 

Lower 

*  The  physical  properties  for  Buckboard-basalt  have  been  taken  from  Cherry  (1967). 
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Distance  from  shot  point-m 


Figure  7.  Particle  velocity  versus  range  profiles  for  the  Pokaran  event  and  a  hypothetical 
event  in  Buckboard-basalt  medium. 


Fora  quantitative  comparison  we  performed  a  calculation  for  a  hypothetical  event  in 
which  a  device  of  12  kt  yield  is  detonated  at  107  m  in  Buckboard  basalt  medium.  Figure 
7  shows  the  particle  velocity  versus  range  profile  at  time  tg  for  Pokaran  event  and  the 
event  in  Buckboard  basalt.  In  general  the  particle  velocities  in  Pokaran  medium  are 
lower  than  those  in  Buckboard  basalt.  The  spall  velocity,  as  expected,  is  also  lower 
(30  m/sec  compared  to  36  m/sec).  Accordingly  the  cratering  efficiency  of  Pokaran  rock 
should  be  lower.  This  is  also  indicated  by  the  values  of  the  late-time  kinetic  energies 
along  the  vertical  direction.  These  are  5-7%  and  9%  respectively  in  the  two  cases. 
Following  Burton  et  al  (1975),  this  gives  us  an  effective  equivalent  yield  of  5-7/9  x  12 
=  7-6  kt  for  a  detonation  at  107  m  in  Buckboard  basalt,  which  would  give  similar  crater 
dimensions  as  the  Pokaran  experiment.  Actually  this  may  be  slightly  a  lower  estimate  of 
the  effective  yield  as  for  other  than  vertical  directions  the  shock  wave  will  traverse  more 
through  the  shale  medium  which,  as  an  all  shale  calculation  shows  will  have  more 
kinetic  energy  imparted  to  it.  For  all  shale  calculations,  the  kinetic  energy  at  time  ig  is 
6-7  %  with  corresponding  effective  yield  of  8-9  kt.  Further,  there  will  be  the  effect  due  to 
interaction  of  shock  waves  at  the  interface  of  the  shale-sandstone  medium,  which  will 
redirect  the  particle  velocity  vectors  from  their  initial  radial  directions.  This  is  not 
accounted  for  in  one-dimensional  calculations.  A  rough  estimate  of  this  puts  the 
effective  sdob  of  Pokaran  experiment  to  be  57  m/kt1/3'4. 

Considering  all  the  above  factors,  we  may  say  that  the  Pokaran  experiment  was 
equivalent  to  that  of  an  ~  8kt  detonation  at  the  same  depth  (effective  scaled  depth  of 
burst  =  58  m/kt1/3  4)  in  basalt.  It  is  known  that  at  this  range  of  scaled  depth  of  burial 
one  gets  a  retarc  or  a  marginal  crater  depending  upon  the  bulking  properties  of  the 
mound  fall-back  material.  Since  the  mound  in  Pokaran  experiment  did  not  break  and 
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also  the  post-shot  analysis  showed  that  no  significant  bulking  of  the  rock  had  taken 
place,  the  formation  of  a  marginal  crater  is  not  surprising. 

In  conclusion,  the  reason  for  obtaining  a  smaller  crater  at  Pokaran  is  due  to  the  fact 
that  a  smaller  amount  of  the  device  energy  was  converted  by  the  medium  into  kinetic 
energy.  This  is  principally  due  to  the  geological  properties  of  the  Pokaran 
medium. 


4.6  N on-venting  of  Pokaran  experiment 

According  to  the  published  radioactive  vent-fraction  fc  versus  scaled  depth  of  burial 
curve  (Lapage  1974;  Siddons  1974),  for  the  Pokaran  experiment  at  sdob  of 
5 1  -5  m/fkt)1  /3'4,  fc  is  predicted  to  be  very  small  but  to  have  a  finite  value  of 0-003,  instead 
of  the  observed  value  of  zero.  We  feel  that  this  small  discrepancy  is  because  such  scaling 
laws  ignore  the  fact  that  the  vent  fraction  depends  not  only  on  sdob  but  also  on  the  type 
of  the  rock  medium  in  which  the  experiment  is  carried  out.  Now,  as  discussed  above,  the 
physico-mechanical  properties  of  a  rock  reflect  themselves  well  in  the  amount  of  energy 
of  the  explosion  imparted  to  the  rock  as  kinetic  energy  and  the  effective  yield  of  an 
explosion  relative  to  a  standard  rock  medium  can  be  defined  as 

W^xW,  (3) 

where  x  is  the  ratio  of  the  kinetic  energies  given  to  rock  medium  and  a  standard  rock 
medium  by  explosions  of  yield  W  carried  out  at  the  same  depth  of  burst  d.  This  allows  us 
to  simply  modify  the  scaling  law  for  fc  from 

fc  =  f(d/(W)113*),  (4) 

to 

fc=f(d/(xwy3i),  (5) 

we  take 

x  =  v\]v2,  (6) 

where  vx  and  v  are  the  peak  velocities  of  surface  ground  zero  for  the  two  explosions  in  a 
given  medium  and  the  standard  medium.  For  quantitative  estimates,  we  have  again 
chosen  buckboard  Mesa  basalt  rock  in  usa  as  standard.  The  variation  of  v  with  d  for  this 
rock  type  (see  Harlan  1967)  is 

v  =  19700  (d/Wl,3)~16.  (7) 

Here  W  is  in  kilotons  and  d  is  in  meters.  For  Pokaran  event,  x  turns  out  to  be  0-5 
compared  to  10  for  basalt.  Using  this,X  predicted  for  Pokaran  event  is  <  10-6  ~  zero 
in  agreement  with  experiment. 

To  summarize,  the  non-venting  of  the  Pokaran  experiment  seems  to  have  been  due  to 
the  smaller  cratering  efficiency  of  the  shale-sandstone  medium.  Because  only  a  small 
kinetic  energy  was  imparted  to  the  mound,  it  did  not  rise  very  high  relative  to  the  depth 
of  burial.  Consequently,  the  tensile  stresses  developed  in  the  mound  were  not  enough  to 
disintegrate  it  and  cause  dynamic  venting.  Also,  because  the  mound  did  not  rise  much, 
the  minimum  thickness  of  the  overburden  layer  above  the  cavity  was  much  larger 
(~  24  m  compared  to  2  m  for  the  Danny  Boy  event  which  vented  with  fc  =  0*042)  so 
that  there  was  no  connection  of  fissures.  This  prevented  the  radioactivity  from  leaking 
into  the  atmosphere. 
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Appendix  A.  On  the  correlation  between  the  crater  dimensions  and  the  mound  kinetic 
energy  for  peaceful  underground  explosions 

The  terminal  effect  of  an  underground  peaceful  nuclear  explosion  on  a  geological 
medium  is  to  produce  a  crater  if  the  depth  of  the  burial  of  the  explosive  device  is  less 
than  about  6Gm/kt134*  (Teller  et  al  1968).  The  dimensions  of  such  a  crater,  as 
indicated  by  many  computer  code  calculations  (Terhune  et  al  1970;  Burton  et  al  1975), 
depend  on  the  motion  of  the  mound  which  in  turn  depends  on  the  fraction  of  total 
energy  of  the  explosive  converted  into  the  kinetic  energy  of  the  mound.  Any  change  in 
the  explosion  conditions  such  as  the  type  of  the  explosive,  chemical  or  nuclear,  variation 
in  the  rock  properties,  such  as  moisture  content,  compressibility,  strength  etc  or 
geological  variations  such  as  layering,  will  get  reflected  in  the  amount  of  energy 
transferred  to  the  mound  and  affect  the  crater  dimensions.  Although  the  dependence  on 
the  kinetic  energy  was  recognised  quite  some  time  ago,  no  quantitative  relation  appears 
to  have  been  deduced  between  it  and  crater  dimensions.  Here  we  attempt  to  show  that 
such  a  relation  exists. 

To  drive  the  relationship,  we  have  carried  out  a  similarity  analysis  of  the  linear  crater 
dimensions  versus  kinetic  energy  of  the  mound  following  the  procedure  given  by  Baker 
et  al  (1973).  A  linear  crater  dimension  /  (/  =  apparent  crater  radius  Ra  or  apparent  crater 
depth  Da)  produced  by  an  explosion,  beneath  the  surface  of  earth  is  at  least  a  function  of 
the  explosive  yield  ( W )  and  depth  of  burst  ( d ).  We  assume  that  the  effect  of  the 
additional  variables  like  the  type  of  explosive  source  (chemical  or  nuclear)  and 
geological  medium  (compressibility,  porosity,  strength,  water  content,  etc)  is  reflected 
in  the  kinetic  energy  ( E )  developed  in  the  mound.  This,  we  represent  by  a  function 


/  -  l(d ,  W,  £). 

It  is  then  easy  to  show  from  dimensional  analysis  that 

(Al) 

l/d  —f(E/W). 

This  relation  means  that  if  for  two  craters, 

(A2) 

E\/Wl  =  E2/W2. 

Then 

(A3) 

h/di  —  Ij/di, 

(A4) 

i.e.  the  two  craters  will  be  similar. 

Though  this  analysis  gives  the  functional  dependence  of  the  linear  crater  dimensions 
on  the  kinetic  energy,  the  form  of  the  function  is  yet  undetermined.  In  figures  Al  and 
A2,  we  have  plotted  Ra/d  versus  £/JTand  DJd  versus  E/ W respectively  for  chemical 
explosions  in  Beerpaw  shale.  The  data,  which  have  been  taken  from  Burton  et  al  (1975), 
include  the  values  of  the  quantities  Ra  and  Da  and  kinetic  energy  E ,  which  has  been 
calculated  using  computer  code  tensor  (Maenchen  and  Sack  1964).  Although  there  are 
only  four  points  in  the  plots,  the  correlation  between  RJd  and  £/Vkand  between  DJd 
and  E/Warz  clearly  brought  out  by  this  data.  A  linear  relationship  is  implied.  It  may  be 
noted  that  the  kinetic  energy  of  the  mound  is  not  an  observable  quantity  and  also  is  not 
easily  computable  as  it  requires  two-dimensional  codes  like  tensor  (Maenchen  and 


*  This  value  depends  upon  the  geological  medium. 
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Figure  A-l.  Ra/d  versus  E/W for  Beerpaw  shale. 


Figure  A-2.  Da/d  versus  E/W  for  Beerpaw  shale. 


Sack  1964).  Hence  for  practical  utilization  of  a  crater  dimension  versus  E/W  correlation, 
we  have  to  have  a  suitable  measure  for  this  kinetic  energy.  Briefly,  the  various 
equivalent  measures  adopted  for  this  mound  kinetic  energy,  as  summarized  by  Gupta  et 
al  (1979b)  are: 

(1)  velocity  profile  between  the  cavity  and  the  free  ground  surface  in  the  vertical 
direction  at  the  time  the  rarefaction  arrives  at  the  cavity  surface; 

(2)  free-field  kinetic  energy  versus  time  profile  from  a  calculation  in  which  the  free 
ground  surface  is  not  included  in  the  calculation;  and 

(3)  peak  pressure  versus  radius  profile. 
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Abstract.  Discovery  of  the  neutron  in  1932  by  Chadwick  ushered  in  a  new  era  of  scientific 
research  and  technology.  The  neutron  is  endowed  with  unique  properties  in  its  mass,  life  time, 
spin  and  magnetic  moment  etc  and  every  important  property  has  been  used  in  the  study  of 
condensed  matter,  biological  molecules,  nuclear  forces,  stellar  objects  and  other  fields. 
Neutron  has  a  wide  range  of  applications  in  power  production,  breeding  of  fissile  fuel, 
radiography,  medicine  and  others. 

Keywords.  Neutron  properties;  spallation;  plasma  focus;  breeding;  neutron  radiography. 
FACS  No.  28-40;  29-25;  52-75 


1.  Introduction 

It  is  seldom  realised  how  important  the  discovery  of  neutron  has  been  in  our 
understanding  of  nature.  Neutron  by  virtue  of  being  a  major  constituent  of  the  nucleus 
has  been  responsible  for  the  subject  of  nuclear  physics.  Neutrons  have  not  only  been  the 
key  for  nuclear  transformations,  production  of  isotopes,  production  of  nuclear  power 
and  the  like  but  are  also  fundamentally  responsible  for  the  characteristics  displayed  by 
the  sun  and  the  stars  as  we  see  them  today. 

In  India  Dr  Raja  Ramanna’s  name  is  synonymous  with  neutron  physics.  Having 
started  his  research  career  in  King’s  college  London  using  a  small  Ra-Be  neutron 
source  for  fission  physics  studies,  he  has  emerged  as  a  pioneer  in  nurturing  the  field  of 
neutron  physics  in  our  country.  He  set  the  pace  in  early  fifties  for  the  Indian  scientists  to 
do  frontline  research  in  the  then  emerging  fields  like  solid  state  physics,  nuclear  physics, 
neutron  physics  and  reactor  physics  using  neutron  as  a  research  tool.  He  headed  the 
groups  that  were  responsible  for  providing  high  intensity  neutron  sources  from 
reactors.  The  reactors  apsaka,  cirus,  pijrnima  and  latest  dhruva  have  all  been  built 
under  his  guidance.  India’s  first  peaceful  nuclear  explosion  was  also  his  brainchild. 

Neutron  plays  a  very  pivotal  role  as  a  research  tool  through  some  of  its  unique 
properties  like  charge  neutrality,  spin,  magnetic  moment  etc.  Therefore  the  first  part  of 
this  review  article  is  devoted  to  highlight  these  various  properties.  The  second  part  of 
this  review  details  the  various  techniques  employed  for  the  production  of  neutrons.  It  is 
emphasised  that  it  is  not  merely  the  intensity  alone  but  rather  combination  of  number 
and  energy  that  imparts  special  characteristics.  Two  applications  have  been  cited  at  the 
end  in  some  detail. 

This  article  is  dedicated  to  Dr  Ramanna  on  the  occasion  of  his  60th  birthday  but  for 
whose  vision  and  missionary  zeal  neutron  physics  would  not  have  taken  roots  and 
grown  to  attain  its  present  stature  in  India. 
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2.  Discovery  of  the  neutron 

Lord  Rutherford,  the  discoverer  of  the  nuclear  atom,  made  an  amazing  prophecy 
(Rutherford  1920)  by  stating 

“Under  some  conditions  it  may  be  possible  for  an  electron  to  combine  much  more 
closely  with  hydrogen  nucleus  (than  in  the  neutral  hydrogen  atom)  forming  a  kind  of 
neutral  doublet.  Such  an  atom  would  have  very  novel  properties.  Its  external  field 
would  be  practically  zero,  except  very  close  to  the  nucleus,  and  in  consequence  it  should 
be  able  to  move  freely  through  matter.  Its  presence  would  probably  be  difficult  to  detect 
by  the  spectroscope  and  may  be  impossible  to  contain  in  a  sealed  vessel.  On  the  other 
hand  it  should  enter  readily  the  structure  of  atom  and  may  either  unite  with  the  nucleus 
or  be  disintegrated  by  its  intense  field,  resulting  possibly  in  the  escape  of  a  charged 
hydrogen  atom  or  an  electron  or  both”. 

The  discovery  of  the  neutron  by  Chadwick  (1932)  twelve  years  after  the  prophecy  was 
the  culmina  tion  of  various  investigations  in  several  laboratories  in  Europe.  It  all  started 
in  1930  when  Bothe  and  Becker  (1930)  tried  to  detect  nuclear  disintegration  with  a 
needle  counter.  They  bombarded  various  light  elements  like  Li,  Be  and  B  with  alpha 
particles  from  a  Polonium  source  and  measured  the  energies  of  gamma  rays  emitted 
from  the  samples.  The  gammas  from  Li  were  ascribed  to  nuclear  excitation  by  the 
inelastic  scattering  of  the  alpha  particles  and  that  from  boron  was  attributed  to  the  fact 
that  the  residual  carbon  nucleus  is  in  an  excited  state.  Beryllium  in  particular  gave 
gammas  of  very  high  energies  of  ~  5  MeV,  and  this  was  explained  by  Bothe  and  Becker 
saying  that  the  nucleus  was  excited  by  the  capture  of  alpha  particle. 

Two  years  later,  Curie  and  Joliot  used  a  much  more  powerful  Pu-Be  source  to 
investigate  further  the  Be  +  a  reaction.  They  observed  that  the  radiation  from  Be 
ejected  protons  with  kinetic  energies  around  5  MeV,  when  the  foils  containing 
hydrogenous  material  were  inserted  between  beryllium  and  the  ionization  chamber. 
This  lead  to  a  difficult  conclusion  that  in  order  to  produce  such  energetic  compton 
recoils,  the  gammas  must  have  energies  of  the  order  of  50  MeV. 

Shortly  thereafter,  in  1932,  Chadwick  who  was  a  student  of  Lord  Rutherford, 
measured  energies  and  momenta  of  recoils  from  different  materials  using  a  pulse 
ionization  chamber  (figure  1)  and  explained  all  observations  in  terms  of  collisions, 
consistent  with  the  laws  of  conservation,  with  a  penetrating  neutral  massive  particle 
slightly  heavier  than  the  proton.  He  christened  the  new  particle  the  “neutron”  {n)  and  its 
production  by  the  bombardment  of  alpha  particles  on  beryllium  was  described  by  the 
equation 

Be9  +  He4  -»  C12  +  n. 


TO  PUMP 


Figure  1.  Experimental  set-up  used  by  Chadwick  that  led  to  the  discovery  of  the  neutron. 
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The  discovery  of  neutrons  resolved  many  puzzling  questions  vis-a-vis  nuclear  structure. 
Heisenberg  immediately  propounded  that  the  nuclei  contain  only  protons  and 
neutrons.  The  discovery  of  the  neutron  gave  birth  to  a  new  branch  of  physics  called 
neutron  physics. 


3.  Properties  of  neutrons 

3.1  Mass  of  the  neutron 

Chadwick  (1932)  himself  concluded  from  the  study  of  recoil  protons  liberated  by  the 
neutrons,  that  the  mass  of  the  neutron  is  approximately  equal  to  that  of  the  proton, 
probably  between  1-005  and  1  008.  A  more  accurate  estimation  of  the  neutron  mass 
(Chadwick  and  Goldhaber  1934)  was  made  from  the  binding  energy  of  the  deuteron. 
The  mass  of  the  deuteron  is  equal  to  the  sum  of  neutron  mass  mn  and  proton  mass  mp 
less  the  mass  defect  corresponding  to  the  binding  energy  Ed.  Thus 

mn  =  md-mp  +  Ed/c2. 

From  the  known  values  of  md(=  2-013554  amu),  mp  (=  1  007226  amu)  and  Ed 
(=  2-225  MeV),  the  neutron  mass  turns  out  to  be  1  008665  amu.  Measurement  of  the 
photon  energy  emitted  when  slow  neutrons  are  captured  by  hydrogen  lead  to  further 
improvement  in  the  neutron  mass  measurement  (Knowles  1962).  The  accepted  value 
(1982)  for  the  rest  mass  of  neutron  is 

mn  —  1-008665012  amu. 

An  interesting  experiment  to  measure  the  neutron  mass  directly  with  free  neutrons 
was  proposed  by  Stedman  ( 1 968)  and  several  others.  A  neutron  moving  with  a  velocity  v 
is  associated  with  it  a  wavelength  given  by 

X  =  h/(m„v). 

It  is  clear  from  this  relation  that  hjmn  can  be  determined  by  measuring  the  velocity  and 
wavelength  of  the  neutrons.  Time  of  flight  technique  was  proposed  to  measure  the 
velocity  and  the  wavelength  can  be  determined  by  Bragg  reflection  of  the  same  neutrons 
in  a  single  crystal. 


3.2  Charge  of  the  neutron 

There  is  an  abundance  of  evidence  that  the  electric  charge  on  the  neutron  if  any  ( qn )  is 
approximately  zero,  as  noted  by  Chadwick  himself.  Fermi  and  Marshall  (1947) 
obtained  an  upper  limit  for  the  charge  as  10“  18  e  from  neutron  scattering  experiments 
on  xenon.  Shapiro  and  Estulin  (1956)  tried  to  deflect  neutrons  using  strong  electric 
fields  but  could  not  find  any  effect.  Shull  et  al  (1967)  obtained  an  accurate  value  for  the 
ratio  of  the  neutron  charge  to  that  of  the  proton,  using  double  crystal  spectrometer,  as 

( —  1-9  ±  3-7)  10'18. 

\ 

In  a  recent  experiment  at  the  Institut  Laue-Langevin  at  Grenoble  Gohler  et  al  (1982) 
used  a  focussed  beam  of  220  m/sec  neutrons  to  pass  through  an  electric  field  of 
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5-9  kV/mm  for  a  distance  of  10  m  and  estimated  the  charge  of  the  neutron  as 
qn  =  —  (1*5  ±  2-2)  x  10-20  e, 
where  e  is  proton  charge. 

3.3  Spin  of  the  neutron 

The  deuteron  has  spin  1  and  no  orbital  angular  momentum.  Since  the  proton  has  spin  j, 
it  follows  that  the  neutron  must  have  spin  j  or  §.  Schwinger  (1937)  showed  that 
experiments  on  the  scattering  of  neutrons  by  ortho  and  parahydrogen  would  permit 
determination  of  the  neutron  spin.  Such  experiments  carried  out  by  Halpern  et  al  (1937) 
conclusively  demonstrated  that  the  neutron  spin  is  just  j. 

3.4  The  magnetic  moment 

The  first  measurement  of  magnetic  moment  of  the  neutrons  was  carried  out  by  Alverez 
and  Bloch  (1940)  using  magnetic  resonance  technique.  Cohen  et  al  (1956)  using  a 
refined  Rabi-method  obtained  a  more  accurate  value  for  the  neutron  magnetic  moment 
as  pn  =  —0-685039  gp(p.p  is  the  magnetic  moment  of  the  proton).  The  most  accurate 
experimental  value  is  due  to  Green  et  al  (1979)  obtained  at  ill.  This  experiment  has 
incorporated  several  innovations  and  improvements  like  use  of  cold  neutrons  (which 
allowed  narrower  resonance  line  widths),  guide  tubes  and  Ramsey  separated  oscillatory 
field  magnetic  resonance  technique.  The  value  for  the  neutron  magnetic  moment  is 

gn  =  -  0-68497935  gp. 


3.5  The  neutron  lifetime 

One  of  the  important  properties  of  free  neutron  is  its  instability.  Since  the  neutron 
decay  is  the  most  elementary  example  of  the  beta  decay,  measurements  of  neutron  half- 
life  are  of  special  importance,  particularly  in  the  determination  of  weak  coupling 
constants  gv  and  gA.  Neutron  lifetime  also  plays  a  crucial  role  in  astrophysics  where  it  is 
central  to  the  problem  of  helium  production  in  the  early  universe  (Taylor  1979).  The 
neutron  decays  by  weak  interaction  as 

n  ->  p  +  e  +  v 

according  to  the  radioactive  law  with  a  decay  rate 
dN(t)/dt  =  AN. 

In  order  to  measure  the  half-life  one  has  to  determine  the  absolute  number  of  neutrons 
in  a  well  defined  volume  and  obtain  dN(t)/dt  from  the  measured  rate  of  decay  particles 
either  protons  or  electrons  from  this  volume.  The  neutron  density  within  the  beam  is 
obtained  with  a  high  degree  of  precision  using  foil  (l/v  detector)  activation  technique. 
The  most  formidable  problem  in  any  neutron  lifetime  experiment  is  to  be  able  to  detect 
the  protons  or  electrons  from  the  decayed  neutrons  in  the  presence  of  high  gamma 
background. 

Robson  (1950)  made  the  first  serious  attempt  to  measure  its  half-life  accurately.  He 
was  able  to  detect  both  the  decay  particles,  the  proton  and  electron  eliminating  the 
background.  The  major  uncertainty  was  the  effective  volume  from  which  the  decayed 
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particles  emerged.  He  obtained  the  neutron  half-life  as  1 2-8  ±  2-5  min.  Sosnovsky  et  al 
(1959)  carried  out  precise  measurements  and  came  out  with  a  neutron  half-life  as 
11-7  +  03  min.  The  drawback  in  these  experiments  was  that  only  a  small  fraction  of  the 
decays  occurring  in  the  source  was  recorded  directly  since  the  solid  angle  for  the 
collection  was  small.  Christensen  et  al  (1972)  using  an  8  kG  magnetic  field  were  able  to 
record  all  the  decays.  The  magnetic  field  forces  the  decayed  electrons  leaving  the  beam 
to  spiral  about  the  field  lines  so  that  they  eventually  hit  a  large  plastic  scintillator.  This 
measurement  gave  the  half-life  value  as  10*61  ±  0T6  min.  Byrne  et  al  (1980)  using  “very 
cold  neutrons”  obtained  10-82  ±016  min  for  the  neutron  half-life.  The  important 
features  of  this  experiment  were  (1)  a  well-defined  volume  for  the  neutron  beam 
(2)  practically  zero  background,  achieved  by  trapping  the  emerging  protons  for  a 
significant  time  in  an  electromagnetic  potential  well  before  being  released  and  detected. 
Figure  2  shows  schematic  of  the  experimental  layout.  Table  1  gives  the  basic  properties 
of  the  free  neutrons. 


S3 


NEUTRON  BEAM- 
TRAP  ELECTRODE 


-PROTON  DETECTOR 
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.magnet  coils 
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Figure  2.  Experimental  layout  for  the  determination  of  neutron  lifetime. 


Table  1.  Basic  properties  of  the  free  neutrons. 


Quantity 

Value 

Mass  (mj 

1  0086650 12  (0-011  ppm) 

Charge  (qn) 

^  (—  l-5±2-2)10_2oc 

Spin  J 

1 

2 

Magnetic  moment  (pi„ ) 

-0-684979  35  (0-25  ppm) 

Halflife  (weighted  average) 

Ratio  of  weak  interaction 

(10-68  ±0-13)  min 

coupling  constants  (gA/gv) 

— 1-255  ±0-006 

Quantized  properties 

Intrinsic  parity 

+  1 

Isospin 

1 

2 

Baryon  number 

l 

Lepton  number 

0 

Strangeness 

0 

Charm 

0 

Bottomness 

0 

Topness 

0 
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4.  Neutron  sources 

Free  neutrons  do  not  occur  in  nature  because  of  their  short  lifetime  and  therefore  must 
be  produced  artificially.  Neutrons  form  a  part  of  the  atomic  nuclei  of  all  elements 
(except  hydrogen)  where  they  are  held  together  by  short  range  nuclear  forces.  The 
neutrons  can  therefore  be  separated  from  the  nuclei  if  the  nucleus  is  somehow  brought 
into  an  excited  state  whose  energy  exceeds  the  binding  energy  of  the  last  neutron. 
Excitation  of  nuclei  of  selected  isotope  is  generally  accomplished  by  bombarding  them 
with  a  particles,  protons,  deutrons  or  gamma  rays.  Table  2  gives  binding  energy  of  the 
last  neutron  for  some  important  light  nuclei. 

4.1  Neutron  production  by  charged  particle  reactions  and  (y,  n)  reactions 

Excited  compound  nuclei  after  the  capture  of  charged  particles  can  emit  neutrons.  The 
cross-sections  for  the  reaction  depend  on  the  energy  of  the  bombarding  particle  and  on 
the  value  of  the  electrostatic  barrier  of  the  nucleus.  The  cross-section  increases  as  the 
energy  of  the  charged  particle  increases  and  reaches  a  maximum  value  equal  close  to  the 
geometrical  cross-section  of  the  target  nucleus. 

4.1a  (a,  n )  reactions:  This  reaction  has  historical  importance  because  it  was  used  in 
the  discovery  of  the  neutron.  The  reaction  in  general  can  be  written  as 

0L  +  zX*-+z  +  2X^  +  n  +  Q. 

The  <2  value  is  positive  for  exothermic  reaction  and  negative  for  endothermic  reaction. 
Some  well-known  examples  are 

a  +  Be9  -►  C12  +rc  +  5-704  MeV, 
a  +  B11  ->N14  +  n  +  0-158  MeV, 
a  +  Li7  ->  B10  +n  —  2-79  MeV. 

Radioactive  (a,  n)  sources 

These  were  the  first  amongst  the  neutron  sources  and  even  today  they  are  used  widely. 
Polonium-beryllium,  radium-beryllium,  plutonium-beryllium  and  americium-beryllium 


Table  2.  Binding  energy  of  the  last  neutron  in  light  nuclei 
(Beckurts  and  Wirtz  1964). 


Binding 

Binding 

energy 

energy 

Nucleus 

(MeV) 

Nucleus 

(MeV) 

H2 

2-225 

B10 

8-440 

H3 

6-258 

B11 

11-456 

He3 

7-719 

C12 

18-720 

He4 

20-577 

C13 

4-937 

Li6 

5-663 

c14 

8-176 

Li7 

7-253 

N14 

10-553 

Be9 

1-665 

N 1 5 

10-834 

Be10 

6-814 

N16 

2-500 
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Table  3.  Characteristics  of  some  a  radioactive  and  spontaneous  fission  isotopes. 


Half-life  (years) 

Specific  activity 

Isotope 

a-dis- 

integration 

Spontaneous 

fission 

a-dis- 

integration 

(Curie/g) 

Spontaneous 

fission, 

(fissions/sec/g) 

2i°Po 

0-38 

_ 

4650 

— 

226Ra 

1600 

— 

1 

— 

239pu 

24360 

5*5  x  1015 

0-06 

1-5  x  103 

241  Am 

457-7 

2x  1014 

3 

0-3 

252Cf 

2-73 

85-5 

540 

8  x  1011 

Table  4.  Neutrons /sec  for  1  g  of  a  radioactive 
isotope  for  some  (a,  n)  sources. 


Isotope 

Target  material 

Be 

B 

F 

210Po 

1-36  x  1010 

3-35  x  109 

1-7  x  109 

226Ra 

1-70  x  107 

3-0  x  106 

3*0  x  106 

239Pu 

1-69  x  105 

4-28  x  104 

2-1  x  104 

241  Am 

1-04  x  107 

2-49  x  106 

1-35  x  106 

252Cf 

2-39  x  109 

5*16  x  108 

3-40  x  108 

are  the  important  radioactive  (a,  n)  sources.  Table  3  gives  characteristics  of  some  a 
radioactive  and  spontaneous  fission  isotopes.  Table  4  gives  maximum  specific  output  of 
neutrons  for  1  g  of  radioactive  isotope  for  some  (a,  n)  sources. 

4.1b  (d,  n)  reactions :  This  is  an  extensively  used  reaction  for  the  production  of 
neutrons.  The  general  equation  for  this  reaction  is 

d  +  zXA-^z+lXA  +  '  +  n  +  Q 

Typical  examples, 

d  +  H2  -*>  He3  +  n  +  3-26  MeV, 
d  +  H3  -►  He4  +  n  +  17*58  MeV, 
d  +  Li7  ->  Be8  +  n  +  15*02  MeV, 
d  +  C12  ->N13+rt  -0*28  MeV, 
d  +  Be9  -►  B10  +  m  +  4*36  MeV. 

Since  the  binding  energy  of  the  deuteron  is  small  (~  2*226  MeV/nucleon)  the  compound 
nucleus  formed  by  the  capture  of  deuteron  is  always  found  in  a  highly  excited  state  and 
consequently  almost  all  (d,  n)  reactions  are  exothermic.  H 2(d,  n)  He3  reaction  is  widely 
utilised  for  the  production  of  monoenergetic  neutrons  because  of  expected  good 
neutron  yields  for  deuteron  energies  even  below  1  MeV.  The  H3  ( d ,  n)  He4  is  the  best 
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among  ( d ,  n)  reactions  for  generating  monoenergetic  neutrons  with  high  positive  Q 
value. 

4.1c  (p,  n)  reactions:  Like  (d,  n)  reaction,  this  reaction  has  large  neutron  output 
compared  to  (a,  n)  reaction  due  to  lower  threshold  energies.  This  reaction  is  extremely 
popular  to  produce  monoenergetic  neutrons  with  small  proton  energies.  The  general 
reaction  is 

p  +  zX*-+z  +  \XA  +  n  +  Q- 

Generally  lithium  and  tritium  are  used  as  targets  for  this  reaction 

p  4-  Li7  -►  Be7  +  n  -  1-646  MeV, 
p  +  H3  -►  He3  +  n  —  0-764  MeV. 

Assuming  that  this  reaction  is  essentially  due  to  p  decay  the  Q  value  of  the  (p,  n)  reaction 
is 

Q  —  Ep  ~~  Qn  •> 

where  Qn  =  0-762  MeV  is  the  Q  value  of  p  decay  of  the  neutron  and  Ep  is  the  maximum 
energy  of  the  p.  Thus  all  (p,  n)  reactions  are  endothermic  if  the  target  nuclei  are  stable. 

4.  Id  (y,  n)  reactions:  The  emission  of  neutrons  by  the  interaction  of  gamma  rays  with 
nucleus  is  known  as  nuclear  photo  effect.  With  increasing  energy  more  complex 
reactions  such  as  (y,  2 n)  (y,  p)  and  (y,  pn)  etc  are  possible.  The  reaction  equation  is 

y  +  zXA-+zXA~l  +  n  +  Q. 

The  minimum  y  energy  required  for  this  reaction  to  take  place  is  equal  to  the  binding 
energy  of  the  neutron  in  the  nucleus.  The  neutrons  are  monoenergetic  if  the  y-ray  is 
monoenergetic.  The  threshold  energy  ranges  from  10  to  20  MeV.  The  energy  of  the 
neutron  can  be  calculated  fairly  accurately  using  the  relation 

^neutron  =  MR(Ey  -  Q)/(MR  +  m), 

where  MR  and  m  are  the  masses  of  target  nucleus  and  the  neutron  respectively. 

4.1e  Radioactive  (y,  n)  sources:  The  y  energy  of  most  of  radioactive  substances  rarely 
exceeds  3  MeV  and  therefore  (y,  n)  reactions  are  possible  only  in  beryllium 
(Q  —  —1-665  MeV)  and  deuterium  (Q  =  —2-225  MeV).  The  drawbacks  of  these 
sources  are  their  small  yields  and  short  half-life  of  the  emitters.  The  target  should  be 
several  centimeters  thick  to  have  good  neutron  output  since  cross-section  for  (y,  n)  is 
small.  Sb1 24-Be  neutron  source  is  most  widely  used  in  practice  due  to  the  comparatively 
large  half-life  (60-9 d)  of  antimony. 

4.2  Neutrons  from  charged  particle  accelerators 

Accelerator-based  neutron  sources  have  distinct  advantage  in  that  the  neutron  source 
strength  and  its  energy  can  be  controlled  with  ease.  Further,  the  source  can  either  be 
continuous  or  pulsed. 

Acceleration  of  charged  particles  like  protons,  deutrons  and  a’s  is  based  on  the 
interaction  of  their  charge  with  an  electric  field.  Essentially  an  accelerator  is  a  device  in 
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which  the  charged  particles  acquire  the  required  energy  by  passing  through  a  number  of 
small  potential  differences  whose  sum  is  equal  to  the  required  V. 

Accelerators  are  classified  into  different  categories  such  as  electrostatic,  high 
frequency  and  induction  accelerators  depending  on  the  way  the  electric  field  is 
generated  to  accelerate  charged  particles.  Van  de  Graaff  and  Cokroft-Walton  cascade 
generators  belong  to  the  first  type  where  the  charged  particles  acquire  energy 
determined  by  the  difference  in  potential  between  the  electrodes. 

Most  important  accelerators  like  cyclotron,  microtron,  synchrotron,  synchrocyclo¬ 
tron  and  linear  accelerators  belong  to  the  second  group.  In  these  accelerators,  the 
particles  interact  with  the  high  frequency  electric  field  or  cavity  resonator  through 
which  they  pass.  Finally  in  the  last  group,  the  particles  are  accelerated  under  the  action 
of  an  eddy  electric  field  produced  due  to  electromagnetic  induction.  Betatron  is  an 
example  of  this  type  of  accelerator. 

Van  de  Graaff  generator :  This  is  mainly  used  to  accelerate  protons  and  deuterons  upto 
few  MeV  for  production  of  monoenergetic  neutrons  using  (p,  n)  and  (d,  n)  reactions. 
Typical  beam  current  is  about  10  pA  from  continuous  mode  operation  and  is  about 
10  mA  in  the  pulsed  mode  operation. 

Cascade  generator :  This  is  similar  to  Van  de  Graaff  except  it  differs  in  generating  the 
high  voltage.  These  are  mostly  used  for  generating  small  voltages  to  produce  neutrons 
from  H 2(d,  n)He3  or  H3  (d,  n)He4  reactions. 

Cyclotron :  Protons  and  a  particles  can  be  accelerated  to  hundreds  of  MeV  in 
cyclotrons.  These  energetic  particles  are  used  to  produce  neutrons  using  (p,  n)  or  (a,  n) 
reactions. 

Other  accelerators :  Synchrocyclotron,  synchrotron  isochronous  cyclotron  (azimu¬ 
thal  variation  of  field)  and  linear  accelerators  are  basically  meant  for  getting  very  high 
energy  particles. 

Electron  accelerator :  These  accelerators  can  be  used  to  accelerate  electrons  upto 
energies  equal  to  50-100  MeV.  On  striking  a  target,  these  electrons  give  intense 
bremsstrahlung  with  a  continuous  spectrum.  The  bremsstrahlung  in  turn  produces 
neutrons  by  (y,  n)  processes.  Bombarding  a  thick  uranium  target  with  40  MeV  electrons 
of  about  1  mA  current  yields  1014  n/sec. 


4.3  Neutrons  from  research  reactors 

Research  reactors  operating  the  world  over  are  the  main  sources  of  copius  supply  of 
neutrons.  The  built-in  facilities  in  the  reactor  enable  carrying  out  a  number  of 
experiments  simultaneous  with  the  production  of  radioisotopes.  Typically  a  10  MW 
thermal  reactor  produces  about  7-5  x  1017  neutrons  per  second.  Unfortunately  only  a 
small  fraction  of  this  source  is  actually  available  to  the  experimenter.  Hence  the 
continued  endeavour  is  to  evolve  and  optimize  core  designs  that  would  maximize  the 
neutron  flux  </>  which  is  defined  as  the  number  of  neutrons  passing  through  unit  area  in 
all  directions  in  each  second.  The  physics  aspects  of  the  reactor  are  basically 
characterized  by  this  flux. 

Since  the  neutrons  in  the  reactor  have  a  broad  distribution  of  energy,  from  0  eV  to 
10  MeV,  the  flux  is  a  function  of  energy  E  of  the  neutron.  Thus  (p(E)dE  is  the  flux  of  the 
neutrons.  Three  regions  of  energies  are  identified  within  the  spectrum. 

(a)  Thermal  neutrons :  (E  <  0-2  eV).  These  neutrons  are  in  thermal  equilibrium  with 
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the  moderator  atoms  and  their  energy  distribution  follows  a  Maxwellian  form  given  by 

</,(£)d£  =  (^pG)/  £exp(-£/£r,d£> 

where  n  is  neutron  density,  k  is  Boltzmann’s  constant,  m  is  neutron  mass  and  T,  the 
temperature  of  the  medium  in  degrees  Kelvin. 

(b)  Resonance  neutrons :  (0-2  eV  <  E  <  05  MeV).  In  this  range,  the  neutron  spectrum 
is  determined  by  the  neutrons  being  slowed  down  by  elastic  collisions  with  the 
moderator  nuclei.  In  this  region  the  spectrum  follows  the  behaviour 

(constant)  d  E 
4>(E)dE  = - - - 

(c)  Fast  neutrons :  (E  >  05  MeV).  These  neutrons  have  more  or  less  the  same  energy 
distribution  as  that  of  fission  neutron  spectrum  which  is  given  by 

N(E)  %  exp  (  —  £)sin  h  (2E)112  (E  in  MeV). 

The  intensity  of  each  of  these  components  depends  upon  the  type  of  reactor,  its 
operating  power  and  the  point  inside  the  reactor.  The  major  limiting  factor  to  have 
thermal  fluxes  better  than  1015  neutrons/sec /cm2  is  the  power  density  which  is 
controlled  by  various  requirements  of  heat  transfer,  heat  transport  and  stability  of  core 
materials  under  irradiation.  Two  representative  types  of  reactors  will  now  be 
considered. 

Research  reactors  fuelled  with  natural  uranium  and  moderated  with  heavy  water  are 
in  operation  in  different  countries.  Indian  reactor  cirus,  Canadian  reactors  nrx  and 
nru  and  French  reactor  EL2  are  examples  of  this  type  of  reactors. 

The  second  type  of  reactors  are  fuelled  with  highly  enriched  uranium  and  moderated 
with  light  water.  Material  testing  reactor  of  us  and  other  swimming  pool  reactors  are 
examples  of  this  category. 

Table  5  gives  the  different  components  of  the  flux  at  the  core  centre  for  these  reactor 
types  operating  at  10  MW. 

A  simple  expression  (Spinrad  1966)  can  be  derived  which  relates  the  core  average  flux 
to  the  core  parameters  like  critical  mass,  core  volume  and  power  density  to  select  the 
type  of  reactor  to  have  increased  fluxes. 

The  core  average  flux  </>,  is  related  to  the  power  density  Pd(MW/l)  and  the 
macroscopic  fission  cross  section  Zy  as 

^  Pd  x  3-3  x  1013 


Table  5.  Flux  components  in  typical  research  reac¬ 
tors  operating  at  10  MW. 


Type 

(p/ 

^epi 

4>th 

NRX 

2  x  1012 

1012 

3  x  1015 

MTR 

1  x  1014 

1013 

1  x  1014 

DHRUVA 

5  x  102 

2x  1013 
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The  Ej  is  further  related  to  critical  masses  and  the  core  volume  V  as 

where  N0  is  Avagadro’s  number,  of  is  the  microscopic  fission  cross-section  and  A  is  the 
molar  mass  of  the  fuel.  Substituting  this  in  the  first  equation 

_  (3-3  x  10 13)A  P 
^  ~  N0of  M  ’ 

where  P  is  total  power.  Thus  it  is  clear  from  these  equations  that  to  get  high  flux,  one 
should  attempt  to  have  a  highly  dilute  core  (M/V  is  small)  or  a  small  critical  mass 
depending  on  whether  power  density  or  power  is  the  limiting  factor. 

The  beam  tubes 

Beam  tubes  are  provided  to  make  neutron  beams  available  outside  the  reactor  for 
carrying  out  the  experiments.  These  tubes  are  normally  cylindrical  aluminium  or 
zircalloy  pipes  which  penetrate  the  shield  in  the  horizontal  direction  to  reach  either  the 
core  or  reflector  or  any  other  location  to  satisfy  the  experimental  requirement.  The 
neutron  current  density  that  can  be  obtained  from  such  a  tube  can  be  calculated  as 
follows.  If  A  is  the  front  surface  area  of  the  beam  hole  and  (p  is  the  flux  at  that  surface, 
then  the  number  of  neutrons  entering  the  beam  per  second  is  (pA/ 4.  If  /  is  the  length  of 
the  beam  tube,  then  the  neutron  current  density  J  at  the  tube  exist  is 

J  =  (pA/Anl2. 

Beam  tubes  are  normally  positioned  tangential  to  the  core  to  reduce  the  gamma  ray 
background.  Pure  thermal  beams  are  extracted  from  the  thermal  column.  For  getting 
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Figure  3.  Schematic  layout  of  the  various  neutron  beam  tubes  in  the  reactor  Dhruva. 


270 


P  K  Iyengar  and  K  Subba  Rao 


epithermal  beams  the  tangential  tubes  are  located  nearer  the  core  and  cadmium  filters 
may  be  used  to  suppress  the  thermal  neutrons  present.  Gamma  ray  intensity  can  also  be 
reduced  with  lead  filters.  Figure  3  shows  the  various  beam  tube  locations  and  the 
proposed  experiments  in  the  reactor  Dhruva  which  is  undergoing  commissioning  tests 
at  barc. 

4.4  Spallation  neutron  sources 

As  early  as  in  1947  E  O  Lawrence  conceived  the  idea  of  using  energetic  charged 
particles  (from  accelerators)  in  conjunction  with  a  target  to  produce  neutrons.  The  next 
important  milestone  in  this  direction  was  a  detailed  design  study  by  Wilson  et  al  (1965) 
of  aecl,  Canada,  of  an  intense  neutron  generator  (ing)  in  the  early  1960s.  Subsequently, 
different  laboratories  including  bnl,  ornl  and  lasl  have  undertaken  programmes  in 
this  challenging  field. 

For  all  practical  purposes  spallation  reactions  are  inelastic  nuclear  reactions  in  which 
one  of  the  two  collision  partners  is  a  complex  nucleus  (target)  and  the  energy  available  is 
several  times  the  interaction  energy  between  the  nucleons  in  the  nucleus.  Thus,  if  the 
incident  energy  exceeds  something  like  50-100  MeV  per  nucleon,  it  is  referred  to  as  a 
spallation  reaction.  Spallation  reactions  are  described  in  terms  of  the  two-step  model 
suggested  by  Serber  (1947).  In  the  first  step  the  incident  high  energy  particle  enters  the 
target  nucleus  and  interacts  with  some  of  the  nucleons  in  the  target  in  what  is  known  as 
an  intranuclear  cascade.  In  this  process  few  nucleons  will  be  knocked  out  of  the  nucleus. 
The  excited  residual  nucleus  deexcites  in  the  second  step  of  this  model  emitting  a 
number  of  single  nucleons.  The  first  stage  is  known  as  ‘fast’  stage,  knock  on  stage  or 
cascade  stage.  Similarly  the  second  stage  is  referred  to  as  slow  stage,  deexcitation  stage 
or  evaporation  stage. 

The  yield  of  such  spallation  neutrons  is  a  function  of:  (a)  incident  particle  type  and 
energy  (b)  target  material  and  its  size  and  geometry.  Figure  4  shows  some  results  of 
experiments  conducted  with  the  protons  from  the  3  GeV  cosmotron  at  bnl.  It  can  be 
seen  that  the  neutron  yield  from  U  targets  is  twice  that  from  lead  target.  This  is  simply 
due  to  fast  fission  effect  in  238U. 

The  neutron  yield  calculations  have  been  carried  out  by  Monte-Carlo  code  called 
hetc  (high  engergy  transport  calculations)  (Chandler  1972)  developed  at  ornl.  This 


Figure  4.  Neutron  yields  obtained  by  bombardment  of  heavy  targets  with  high-energy 
protons. 
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code  is  a  high  energy  version  of  an  earlier  code  designated  nmtc  (nuclear  meson 
transport  code)  (Coleman  1970).  Detailed  system  analysis  carried  out  at  bnl  and  aecl 
indicate  that:  (i)  the  optimum  proton  energy  is  likely  to  be  between  800  MeV  and 
1  GeV  (ii)  the  optimum  beam  current  is  about  300  mA 
Linear  accelerators  capable  of  accelerating  a  steady  continuous  beam  of  protons  at  a 
current  of  ~  100  mA  are  available  today.  Synchrotron  is  another  potential  candidate 
for  getting  high  energy  charged  particle  beams.  Table  6  gives  a  number  of  spallation 
neutron  source  facilities. 

4.5  Dense  plasma  focus  as  a  neutron  source 

The  dense  plasma  focus  (Mather  1964;  Fillipov  1962;  Imshennik  1973)  is  an  intense 
source  of  monoenergetic  neutrons.  Essentially  this  is  a  fast  dynamic  Z-pinch  in  which 
stored  magnetic  energy  is  rapidly  converted  into  plasma  energy  and  then  compressed 
by  its  self-magnetic  field.  The  device  shown  in  figure  5  consists  of  two  cylindrical 
electrodes  in  a  coaxial  configuration  and  a  capacitor  bank.  The  space  between  the 
electrodes  is  filled  with  deuterium  gas  at  a  pressure  of  few  torr.  The  electrodes  are 
connected  to  a  high  energy  (1  kJ  to  1  MJ),  high  voltage  (20  to  80  kV),  low  inductance 
( ~  40  nH)  capacitor  bank  through  a  spark  gap.  As  soon  as  the  spark  gap  is  closed,  the 
capacitor  bank  is  discharged  and  the  gas  breaks  down  at  the  insulator,  allowing  a  radial 


Table  6.  Spallation  neutron  source  facilities. 


Name 

LAMPF 

SIN 

TRIUMF 

ING 

RUTH.  LAB. 

IPNS-I 

KEK 

Location 

U.S.A. 

Swiss 

Canada 

Canada 

UK. 

USA. 

Japan 

accr. 

Linear 

s.o.c. 

Cyclotron 

Linear 

Sync 

His 

Sync 

P  energy 

800  MeV 

590  MeV 

500  MeV 

1  GeV 

800  MeV 

600  MeV 

500  MeV 

Avg.  current 

1  mA 

100 /iA 

100  fiA 

65  mA 

200M 

— 

1-5  fiA 

Target 

W 

Pb,  Bi 

Molten  lead 

Pb-Bi 

(liquid) 

DEP,  U 

238u 

DEPT.  U 

N.  yield/s 

2  x  1016  n/sec 

— 

1016  n/sec 

1019  n/sec 

4  x  1016  n/sec 

2  x  1015  n/sec 

1-8  x  1014 

Pulse  rate 

120 

Con/ pulse 

— 

C.W. 

50 

45 

20 

Power  cons 

27  MW 

6-8  MW 

5  6  MW 

100  MW 

— 

— 

— 

Target  heat 

200  kW 

45  kW 

38  kW 

38  MW 

450  kW 

20  kW 

2  kW 

Figure  5.  Schematic  of  a  plasma  focus  discharge  showing  initial  and  final  phases. 
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current  to  flow  between  the  electrodes.  This  radial  current  produces  an  azimuthal 
magnetic  field.  The  combination  of  the  azimuthal  field  with  a  radial  current  produces 
J  x  B  force  on  the  ions  away  from  the  insulator.  Thus  as  the  current  continues  to  flow 
the  plasma  front  is  accelerated  along  the  z  axis,  ionizing  the  neutral  gas  it  encounters.  As 
soon  as  the  plasma  front  reaches  the  end  of  the  electrode,  the  axial  momentum  of  the 
plasma  causes  the  front  to  fold  over  the  end  and  radially  imploding  towards  the  axis  of 
central  electrode  creating  a  hot  dense  plasma  focus  typically  about  2  mm  in  dia  and  10 
to  30  mm  long.  The  discharge  current  at  the  time  of  focus  formation  can  be  as  high  as 
several  hundred  kiloamperes  to  a  few  mega  amperes.  Intense  bursts  of  electrons, 
neutrons,  ions,  soft  x-rays  (<  lOkeV)  and  hard  x-rays  (>  30keV)  are  observed  to 
emanate  from  the  focus. 

A  unique  feature  of  plasma  devices  is,  irrespective  of  differences  in  design,  the 
neutron  yield  N  of  all  optimised  plasma  focus  devices  is  found  to  obey  a  universal 
scaling  law,  namely  that  the  yield  is  found  to  scale  as  the  fourth  power  of  peak  discharge 
current  (I)  or  as  the  square  of  the  capacitor  bank  energy  ( E ).  Neutron  yield  (Conrad 
1983)  as  function  of  plasma  current  for  different  focusses  of  various  international 
laboratories  including  barc  (Shyam  and  Srinivasan  1978)  is  shown  in  figure  6.  During 
the  1970s  plasma  focus  devices  have  produced  about  101 2  neutrons  per  discharge  from 
{D-D)  reactions.  With  the  present  state  of  art  1014  neutrons  per  discharge  can  be 
achieved  by  dissipating  about  300  kJ  of  stored  energy. 

4.6  Classification  of  neutrons  on  the  basis  of  energy 

Neutrons  are  classified  as  ultra  cold  neutron  (ucn),  very  cold  neutrons  (vcn),  cold 
neutrons  (cn),  thermal  neutrons,  epithermal  neutrons  and  fast  neutrons  depending  on 
their  energy  (or  wavelength).  Since  each  class  of  neutrons  has  a  role  to  play  in  the  basic 
research,  one  has  to  tailor  the  easily  available  neutrons  of  a  given  energy.  Of  all  the 
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Figure  6.  Neutron  yield  as  a  function  of  plasma  current  for  different  foci. 
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neutron  sources  ucn  and  vcn  beams  are  difficult  to  get  with  good  intensities. 

A  very  useful  relation  between  neutron  velocity  V  (m/sec),  de  Broglie  wavelength  (A) 
and  the  neutron  energy  E  (eV)  is  given  by 

E  (eV)  =  5-2  x  10-9  V2  (m/sec), 

=  8*2  x  10-2  A-2  (A). 

Ultra  cold  neutrons :  Neutrons  entering  a  solid  material  from  vacuum  with  energy  E 
less  than  L/eff  (called  the  locally  averaged  Fermi  scattering  potential  for  the  material), 
with  a  corresponding  velocity  less  than  vc  or  wavelength  greater  than  Xc  will  be  reflected 
even  at  normal  incidence.  Such  neutrons  are  called  ultra  cold  neutrons  (  ucn).  Typically 
for  ucn  E  ^  24  x  10“ 7  eV,  v  ^  6-8  m/sec  and  2  ^  580  A. 

Very  Cold  Neutrons :  Neutrons  with  the  velocity  6-8  m/sec  ^  v  ^  100  m/sec  are 
referred  to  as  very  cold  neutrons.  These  neutrons  are  reflected  at  angles  <  90°.  The 
corresponding  energy  and  wavelengths  are 

24  x  10' 7  eV  ^  E  ^  5-2  x  10~4  eV, 

580  A  >  2  ^  12-6  A. 

Cold  neutrons :  These  neutrons  are  reflected  at  very  small  angles  of  incidence  and  have 
velocities  around  800  m/sec  with  a  corresponding  energy  of  0  0033  eV  and  a 
wavelength  of  5  A. 

Although  ucn’s,  vcn’s  and  cn’s  exist  as  the  low  energy  fraction  of  the  Maxwell- 
Boltzmann  distribution,  it  is  almost  impossible  to  extract  them  directly.  So  it  is  normal 
practice  to  generate  these  sources  from  a  fast  source  by  down  scattering  in  a  good 
converter  or  cold  source  at  appropriate  temperature. 

Thermal  neutrons :  Neutrons  which  are  in  thermal  equilibrium  with  the  surrounding 
medium  and  follow  Maxwellian  distribution  for  neutron  density  with  energy. 

Epithermal  or  resonance  neutrons :  Neutrons  in  the  energy  range  from  about  02  eV  to 
500  keV  belong  to  this  category.  The  spectrum  is  determined  by  the  neutrons  being 
slowed  down  by  elastic  collisions  with  the  moderator  nuclei. 

Fast  neutrons:  Neutrons  with  energy  from  500  keV  to  20  MeV. 

Super  fast  neutrons:  Neutrons  possessing  energy  greater  than  20  MeV. 


5.  Applications 

The  important  properties  of  the  neutron  are  its  mass,  charge  neutrality,  spin,  magnetic 
moment  and  lifetime  against  /?  decay  etc.  These  unique  properties  have  made  the 
neutron  an  ideal  tool  for  the  scientist  to  probe  the  unknown  mysteries  of  the  nature. 
The  ever  expanding  research  areas  where  the  neutron  is  playing  a  dominating  role 
encompasses  the  physical  sciences,  biological  sciences,  medical  sciences,  earth  sciences, 
agricultural  sciences  not  to  mention  its  important  role  in  nuclear  power  production  and 
other  industrial  applications  (Schofield  1982). 

Exhaustive  research  work  over  the  years  on  structural  studies  and  dynamical 
properties  of  condensed  matter  using  neutrons  has  been  pursued  at  barc  (Iyengar 
1973).  Study  of  properties  of  magnetic  materials  using  the  inherent  magnetic  moment 
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of  the  neutron  (Satya  Murthy  and  Madhav  Rao  1981),  pr  oduction  and  applications  of 
polarised  neutrons  (Satya  Murthy  and  Madhav  Rao  1984)  will  also  be  not  discussed 
here. 

One  of  the  most  outstanding  applications  of  the  neutron  is  in  power  production 
through  fission  chain  reactions  in  a  reactor.  The  behaviour  of  the  neutron  population  in 
such  a  reactor  can  be  obtained  by  solving  the  Boltzmann  neutron  transport  equation  in 
seven  variables.  The  input  to  solve  the  transport  equation  is  the  neutron  cross-section 
data.  Ramanna  et  al  (1956)  developed  the  pulsed  neutron  technique  in  our  country  to 
generate  thermal  cross-section  data  for  moderators  and  coolants.  The  impact  of  nuclear 
research  on  the  future  technology  of  nuclear  power  was  discussed  by  Iyengar  (1979).  In 
view  of  the  availability  of  excellent  text  books  (Bell  and  Glasstone  1970)  we  will  not 
discuss  this  topic  also. 

5.1  Sub- Lawson  fusion  systems  for  power  production  and  breeding 

In  all  likelihood,  the  first  generation  commercial  fusion  reactors  are  to  be  fuelled  with  a 
mixture  of  deuterium  (D)  and  tritium  (T)  (Hirsch  1975;  Ramanna  1984)  wherein  about 
14*  1  MeV  out  of  1 16  MeV  energy  released  in  the  fusion  reaction  will  be  carried  away  by 
the  neutrons  liberated  in  the  reactions.  The  tokamak  and  the  tandem  mirror  are  the  top 
contenders  in  the  field  (Ramanna  1984).  The  plasma  zone  of  any  such  device  is 
surrounded  by  a  medium,  known  as  blanket,  wherein  the  14  MeV  neutrons  leaking  out 
of  the  plasma  zone  deposit  their  energy  for  the  subsequent  generation  of  power.  A 
conceptual  design  of  a  fusion  reactor  based  on  a  magnetic  confinement  is  shown  in 
figure  7.  For  a  pure  fusion  system  to  be  economical,  it  is  expected  that  the  power 
amplification  factor  Q  (fusion  output  power/power  in  the  plasma)  should  be  of  the 
order  of  10  (Dolan  1982).  Such  high  Q  values  are  unlikely  to  be  achieved  in  the  near 
future  without  further  technological  breakthroughs. 

The  hybrid  concept  (Lidsky  1975)  in  which  fusion  and  fission  systems  are  coupled 
exploits  the  fact  that  fusion  reactions  are  neutron  rich  but  energy  poor  while  fissions  are 
energy  rich  but  neutron  poor.  The  two  hybrid  concepts  that  are  vigourously  pursued 
are  (Maniscalce  1981):  (a)  hybrid  blankets  or  fast  fission  blankets  and  (b)  symbiotic 
blankets  or  suppressed-fission  blankets.  These  two  concepts  are  shown  schematically  in 
figure  8. 


Figure  7.  Conceptual  design  of  a  fusion  reactor  based  on  magnetic  confinement. 
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(b) 


Figure  8.  Schematic  of  hybrid  and  symbiotic  blankets. 


in  the  fast  fission  blanket  system,  the  D-T  fusion  source  is  surrounded  by  a  fertile 
material  blanket.  The  fusion  neutrons  cause  fast  fissions  in  the  fertile  material,  thereby 
amplifying  the  fusion  energy  and  also  multiplying  the  fusion  neutrons.  The  fission 
energy  will  be  utilized  to  generate  power  just  like  in  any  nuclear  power  reactor.  One  of 
the  multiplied  neutrons  is  needed  to  breed  tritium  from  lithium  situated  in  the  blanket 
and  the  remaining  are  available  for  breeding  fissile  fuel  from  fertile  materials  like  232Th 
and  238 U. 

Suppressed  fission  blanket  concept  is  essentially  meant  for  producing  fissile  fuel  from 
fertile  materials.  An  additional  non-fission  neutron  multiplying  zone  such  as  BeO,  lead 
etc  is  placed  between  the  D-T  fusion  source  and  the  fertile  blanket  (Nargundkar  et  al 
1984),  The  neutron  multiplier  also  moderates  the  14  MeV  fusion  neutrons  to  below 
fission  threshold  energy  (1.5  MeV)  before  they  reach  the  fertile  zone,  thus  suppressing 
fissions  in  the  blanket  while  breeding  the  fissile  material. 

Although  the  basic  motivation  for  developing  a  fusion  breeder  is  essentially  the  same 
as  for  the  fission  breeder  i.e.  to  exploit  the  enormous  energy  potential  of  the  world’s 
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abundant  Th-232  and  U-238  natural  resources,  there  are  important  differences  that 
make  the  fusion  breeder  a  better  choice.  The  following  advantages  are  noteworthy: 
(i)  Fusion  breeder  needs  no  fissile  inventory,  (ii)  Power  densities  are  10  to  200  times 
less  than  those  in  the  fission  breeder,  (iii)  The  fuel  production  and  energy  production 
can  be  separated,  (iv)  Fusion  system  can  produce  many  times  (~30)  more  net  fissile 
product  per  unit  power  than  a  fast  breeder,  (v)  Since  enormous  amounts  of  fissile  fuel 
can  be  produced  from  fusion  breeder,  many  fission  reactor  concepts  can  be  adopted, 
not  necessarily  the  fast  breeders,  (vi)  Fusion  neutrons  can  also  be  used  to  transmute  the 
longlived  radioactive  fission  waste  products  to  less  hazardous  and  toxic  substances 
with  reduced  half-lives.  This  solves  the  vexing  problem  faced  by  the  fission  reactors, 
(vii)  There  are  no  safety  problems  since  power  is  generated  in  a  subcritical  system. 

5.2  Neutron  radiography 

Neutron  radiography,  a  relatively  new  and  powerful  technique  of  producing  images  of 
objects,  is  finding  applications  in  varied  and  diverse  fields  such  as  engineering, 
metallurgy,  reactor  technology,  aerospace,  medicine,  ordnance  and  electronics  etc. 
Hitherto  the  most  established  technique  of  flaw  detection  in  industrial  products  was 
through  x-ray  radiography.  However  this  method  has  limitations  and  fails  to  give 
required  information  under  certain  situations  like  locating  light  material  behind  or 
within  heavy  materials,  testing  of  radioactive  parts  or  irradiated  fuel  assemblies  of 
power  reactors,  detection  of  hydrogeneous  inclusions  in  materials  and  inspection  of 
thick  plates  of  heavy  materials.  In  all  these  situations  neutron  radiography  can  be  used 
successfully  because  the  way  neutrons  interact  with  matter  is  totally  different  from  the 
way  x-rays  interact.  X-rays  interact  with  the  orbital  electrons  only  and  hence  the 
attentuation  coefficient  increases  with  increasing  Z  unlike  the  neutrons  which  interact 
with  the  nucleus  of  the  atom.  Another  important  difference  is  the  ability  of  the  neutrons 
to  distinguish  between  the  isotopes  of  same  element  which  is  impossible  with  x-rays. 

The  technique  of  neutron  radiography  involves  the  placement  of  the  sample  under 
test  in  the  path  of  a  well-collimated  beam  of  thermal  neutrons  and  studying  the 
transmitted  beam  modified  by  the  attenuations  within  the  sample.  Since  the  photo¬ 
sensitivity  of  x-ray  films  to  neutrons  is  very  low,  the  transmitted  neutron  beam  is  made 
to  fall  on  a  suitable  converter  screen  which  in  turn  emits  photographically  sensitive 
radiations.  These  radiations  finally  expose  a  conventional  x-ray  film  to  give  the  required 
radiograph. 

In  the  first  method,  the  converter  screen  along  with  the  x-ray  film  is  placed  behind 
the  object  under  test  and  exposed  sufficiently  long  to  get  the  image.  The  converter 
screens  are  thin  sheets  of  Rh,  Gd,  Cd,  Sm  or  a  suitable  scintillator  loaded  with  Li-6  or  B- 
10.  This  method  has  some  drawbacks  if  gamma  ray  background  is  present  in  the 
neutron  beam  or  the  specimen  itself  is  radioactive.  The  second  technique  called  the 
transfer  method  circumvents  this  problem.  In  this,  a  converter  screen  alone  is  exposed 
to  the  transmitted  neutron  beam  and  the  screen  becomes  radioactive.  The  latent  image 
of  the  specimen  formed  on  the  screen  is  then  transferred  to  the  photographic  film  in  a 
separate  place  free  from  any  background.  Suitable  converter  materials,  keeping  the 
half-life  in  mind  are  In,  Dy  and  Au. 

The  third  technique  of  recent  origin  uses  Li-6,  B-10  or  U-235  loaded  screens  to 
convert  the  thermal  neutrons  into  alpha  particles  or  fissions  fragments.  These  charged 
particles  fall  on  a  special  plastic  material  causing  radiation  damage  along  their  paths. 
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The  radiograph  is  obtained  after  etching  the  plastic  sheet  in  a  chemical  solution.  The 
advantages  of  this  method  are  (i)  totally  gamma  insensitive,  (ii)  no  limitation  on  time 
of  exposure  and  (iii)  elimination  of  use  of  x-ray  films. 

In  addition  to  what  has  been  mentioned  some  specific  applications  in  nuclear 
technology  are:  (a)  testing  of  unirradiated  nuclear  fuel  assembly  to  give  qualitative 
information  on  assembly,  voids,  foreign  materials  and  quantitative  information  on 
homogeneity,  average  density  etc.  (b)  testing  of  irradiated  fuel  to  give  information  on 
leaks,  densification,  swelling,  migration  and  disintegration  etc. 

A  dedicated  neutron  source  reactor  specially  meant  for  neutron  radiography 
applications  is  being  set  up  at  the  Reactor  Research  Centre,  Kalpakkam  (Srinivasan  et 
al  1979)  for  testing  of  fuel  assemblies  of  fast  breeder  test  reactor  and  for  other  related 
applications. 


6.  Conclusion 

Neutrons  by  virtue  of  their  unique  and  unparalleled  properties  have  provided  scientists 
and  engineers  with  a  tool  to  unravel  the  mysteries  of  nature.  Every  important  property 
of  the  neutron  has  been  exploited  by  the  researchers  to  help  probe  into  the  fundamental 
properties  right  from  understanding  the  nature  underlying  the  nuclear  forces  to  the 
formation  and  the  structure  of  the  stars.  With  further  breakthroughs  in  technology 
more  and  more  intense  neutron  sources  tailored  to  specific  needs  will  become 
increasingly  available  that  would  boost  the  neutron  utility  further  both  in  the  fields  of 
basic  and  applied  sciences.  Neutrons  have  already  contributed  in  a  large  measure  to 
global  human  welfare  and  would  undoubtedly  continue  to  do  so  in  an  ever  increasing 
manner  in  years  to  come.  It  is  no  prophecy  therefore  to  predict  that  the  neutron  which 
celebrated  its  golden  jubilee  in  the  year  1982  will  have  many  more  jubilees  in  store  in  the 
future. 
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Abstract.  The  advent  of  nuclear  reactors  ushered  in  an  era  of  increasing  number  of  sources 
of  ionizing  radiations.  However,  the  potential  of  ionizing  radiations  to  cause  harmful  effects 
was  recognized  soon  after  the  discovery  of  x-rays  and  radioactivity  i.e.  long  before  the  building 
of  nuclear  reactors.  Therefore,  protection  against  ionizing  radiations  has  been  of  paramount 
concern  and  has  guided  the  development  of  atomic  energy  and  related  fields.  The  advances  in 
technology  in  general  resulted  in  an  increase  in  accidents  causing  injury  and  death.  It  was 
realised  that  even  medicines,  food  additives  and  a  host  of  other  substances  of  daily  use  had 
injurious  side  effects.  Smoking  was  found  to  be  extremely  harmful.  From  these  emerged  the 
concepts  of  quantitative  and  relative  risks.  This  article  discusses  briefly  the  concept  of  risk  vis- 
a-vis  ionizing  radiations  and  approaches  to  protection  against  them. 

Keywords.  Risk  evaluation;  ionizing  radiation;  radio  therapy;  cancer  induction;  somatic 
effects;  diagnostic  radiology;  radiation  protection . 

PACS  No.  28  SO;  87  60 

1.  Introduction 

X-rays  and  radioactive  emanations,  both  were  in  common  use  soon  after  their 
discoveries.  That  these  radiations  might  be  harmful  at  high  doses  was  also  quickly 
realized.  Voluntary  individual  and  coordinated  efforts  towards  protection  were  taken 
up  early.  Scientific  bodies  in  countries  like  Germany  (1913),  uk  (1915)  and  the  usa 
(1916)  were  reviewing  the  requirement  of  their  safe  use.  In  1925  the  International 
Congress  of  Radiology  was  formed  and  under  its  patronage  were  born  the 
International  Commission  on  Radiation  Units  and  Measurements  (icru)  in  1925  and 
the  International  Commission  on  Radiological  Protection  (icrp)  in  1928. 

The  icrp  issued  its  first  recommendations  in  1928  itself  and  the  latest  major  revision 
came  in  1977  (icrp  1977a).  While  the  presence  of  radiations  and  radioactivity  has 
tremendously  proliferated  during  the  90  years  since  their  discovery  considerable 
information  and  knowledge  has  also  become  available  concerning  their  interactions 
with  and  the  effects  on  the  living  systems.  More  is  probably  known  about  the 
consequences  of  exposure  to  ionizing  radiations  than  any  other  environmental  agent  or 
occupational  hazard.  Paradoxically  greater  knowledge  about  the  consequences  of 
radiation  exposure  has  led  to  its  becoming  an  issue  of  serious  concern  to  society 
(Pochin  1980;  Goldman  1982).  Partly  this  is  due  to  the  growing  awareness  of  risks 
encountered  in  day-to-day  experience:  viz  that  medicines  have  side  effects,  food  colours 
may  be  harmful,  industrial  effluents  are  hazardous,  pesticides  leave  damaging  residues 
and  a  host  of  similar  other  findings,  besides  the  usual  accidents,  house  collapses  and 
fires,  etc.  Detailed  studies  of  occupational  mortality  risks  have  been  carried  out  with  a 
view  to  have  the  right  perspective  of  risks  in  different  occupations  (ICRP  1977b;  Cohen 
1981). 
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2.  Evolution  of  the  concept  of  risk 

Fora  long  time  many  scientists  believed  in  a  threshold  level  of  radiation  exposure  below 
which  no  harm  would  occur  and  which  was,  therefore,  thought  to  be  a  safe  level.  But 
data  have  been  accumulating  over  the  years  which  show  that  any  exposure,  however 
small,  may  not  be  considered  so  low  that  there  was  no  risk  due  to  it.  It,  therefore, 
became  essential  to  make  a  quantitative  assessment  of  the  amount  of  risk  or  the  degree 
of  safety  ensured  at  the  prescribed  level  of  dose  limits.  But  this  immediately  leads  to 
questions  such  as  “what  is  the  amount  of  risk  which  is  acceptable”,  “how  safe  is  safe 
enough?”  (Fischhoff  et  al  1979;  Watson  1981;  Darby  and  Keeney  1981).  Answers  to 
these  questions  involve  not  only  technical  but  economic  as  well  as  social  aspects. 
Equally  important  are  individual  and  collective  public  responses  to  risk.  Perception  of 
risk  is  most  likely  to  have  variants  such  as  (i)  economic  constraints  (ii)  bias  (iii)  strong 
personal  beliefs  and  (iv)  the  format  of  presentation.  It  is,  therefore,  very  important  to 
place  risks  in  proper  perspective.  This  can  be  done  by  placing  them  in  a  comparative 
order  with  other  hazards  or  different  sources  of  the  same  hazard.  Ionizing  radiations, 
for  example,  are  encountered  as  natural  background,  in  the  environment,  medicinally, 
occupationally  and  so  on.  This  approach  has  led  to  the  realisation  that  radon  gas 
emanating  from  construction  materials  and  accumulating  in  closed  or  poorly  ventilated 
buildings  is  a  major  source  of  public  exposures.  The  acceptability  of  risk  is  associated 
with  the  benefits  derived  from  the  source  of  the  risk.  It  also  follows  that  the  acceptable 
risk  would  be  the  risk  associated  with  the  best  of  the  available  alternatives. 

There  are  various  approaches  to  determining  the  best  alternative,  but  four  methods 
frequently  employed  are  cost-benefit  analysis,  revealed  preferences,  expressed  pre¬ 
ferences  and  natural  standards  (icrp  1982;  Starr  et  al  1976).  The  most  used  index  of  risk 
is  the  fatality  rate  both  for  natural  disasters  as  well  as  human  activities.  It  is  observed 
that  risks  of  the  order  of  10~3  fatalities  per  person  per  year  are  non-existent  and  are 
obviously  in  the  category  of  unacceptable.  Risks  of  the  order  of  10“  6  per  year,  on  the 
other  hand,  do  not  cause  any  anxiety  to  the  average  person  and  are  hence  acceptable. 
For  risk  levels  between  these  limits,  society  spends  money  to  reduce  them,  or  pays 
compensation  or  derives  commensurate  benefits  to  put  up  with  the  inconvenience.  Starr 
(1969, 1981)  who  studied  these  rates  arrived  at  a  socie tally  acceptable  and  unacceptable 
benefit  risk  pattern  of  involuntary  exposures  shown  in  figure  1.  He  also  analysed  the 


Risk  (death  rate  per  year  per  million  population)  Class  of  Risk 


Figure  1.  Benefit-risk  pattern  of  involuntary  exposure  (Source:  Starr  1981). 
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AVERAGE  ANNUAL  BENEFIT/PERSON  INVOLVED  (DOLLARS) 

Figure  2.  A  comparison  of  risk  and  benefit  to  us  society  from  various  sources  (Source:  Starr 
1970) 

voluntary  and  involuntary  aspects  of  risk,  depicted  in  figure  2,  and  derived  what  might 
be  called  the  “laws  of  acceptable  risk”  viz 

(i)  the  acceptability  of  risk  is  roughly  proportional  to  the  third  power  of  the  benefits; 

(ii)  there  seems  to  be  a  willingness  on  the  part  of  the  public  to  accept  risks  from 
voluntary  activities,  roughly  a  thousand  times  greater  than  it  would  tolerate  from 
involuntary  activities; 

(iii)  there  is  an  inverse  relationship  between  the  acceptable  level  of  risk  and  the 
number  of  persons  exposed  to  that  risk. 


3.  Ionizing  radiations 

3.1  Nature  and  measurement  of  ionizing  radiations 

The  electromagnetic  or  particulate  radiation  interacts  with  matter  causing  physical 
disruption  of  the  neutral  atom  into  a  knocked-out  electron  and  the  residual  electron- 
deficient  atom,  the  two  constituting  an  ion  pair.  The  process  is  called  ionization.  X-rays, 
gamma  rays  and  beta  particles  produce  these  ion  pairs  in  sparse  paths  and  are  called  low 
let  (linear  energy  transfer)  radiations.  By  contrast  alpha  particles,  protons  and  heavy 
ions  produce  ion  pairs  in  closely  spaced  paths  and  are  called  high  let  radiations.  The 
measurement  of  ion  pairs  produced  in  air  is  the  first  step  in  the  measurement  of 
radiation.  The  roentgen  ( R )  is  that  amount  of  radiation  which  produces  1  esu  of  charge, 
positive  or  negative,  in  1  c.c.  of  dry  air  at  ntp.  In  the  new  international  system  of  units 
(si)  radiation  exposure  in  air  is  measured  in  units  of  coulomb/kg.  The  production  of  ion 
pairs  causes  absorption  of  energy  leading  to  absorbed  radiation  dose.  The  deposition  of 
lOOergs/g  of  matter  constituted  1  rad  of  absorbed  dose.  The  si  system  has  replaced  rad 
by  Gray,  symbol  Gy,  and  equals  1  J/kg:  1  rad  =  0  0 1  Gy. 

The  specification  of  absorbed  dose  is  a  physical  event.  The  differences  in  the  let  of 
different  radiations  causes  differences  in  microscopically  distributed  energy  in  matter. 
To  account  for  these  differences,  which  ultimately  lead  to  differences  in  biological 
effects  an  LET-dependent  multiplier  factor  Q,  called  the  quality  factor,  is  used  to  obtain  a 
new  quantity  called  the  dose  equivalent  (de).  The  factor  Q  has  values  ranging  from  1  to 
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20.  The  de  is  measured  in  units  of  rem  or  its  si  equivalent  Sievert,  symbol  Sv;  1  rem 
=  0*01  Sv  (icru  1980;  Jain  and  Soman  1978). 

3.2  Radiation  effects  on  the  living  system 

Ionizing  radiations  are  known  to  affect  living  cells  by  killing,  interfering  with  their 
normal  functioning  like  delay  in  cell  division,  inducing  chromosomal  aberrations  and 
gene  mutations,  etc.  However,  the  living  system  is  also  capable  of  self-repair  of  at  least 
some  types  of  insults  to  it.  Therefore,  damages  to  the  living  organism  are  intervened  by 
the  repair  mechanisms  particularly  at  low  doses  of  low  let  radiations.  The  high  cellular 
sensitivity  to  radiation  (radiosensitivity)  is  governed  by  certain  characteristics  of  the  cell 
system  like  (i)  primitiveness  (ii)  proliferative  activity  (iii)  differentiation  (iv)  number 
of  divisions  required  to  attain  specialisation/maturity  etc.  Lymphocyte  is  an  exception 
to  these  general  observations.  Table  1  lists  cellular  sensitivity  in  the  decreasing  order. 
However,  potential  radiation-induced  cancer  risk  is  not  necessarily  correlated  with 
rapid  cell  division  in  tissues  highlighting  perhaps  the  complexity  of  cancer  etiology. 

3.3  Acute  effects  of  radiation 

Information  on  the  effects  of  radiation  has  come  either  from  laboratory  experiments  on 
animals  or  persons  exposed  accidentally  to  high  doses  of  radiation.  In  vitro  studies  of 
single  cell  preparations  show  that  cell  survival  and  reproductive  capacity  can  be 
diminished  by  relatively  high  radiation  doses.  Uniform  irradiation  of  the  human  body 
leads  to  changes  in  the  blood  count  picture,  the  extent  of  which  depends  on  the  dose. 
The  lymphocyte  being  the  most  sensitive,  its  numbers  decrease  rapidly  and  remain  so 
for  long  periods.  Hemoglobin  being  the  least  sensitive  is  affected  least  and  decreases 
slowly.  Other  components  of  the  blood  have  a  response  which  is  between  these 
two  extremes  and  is  shown  in  figure  3  for  neutrophils  and  figure  4  for  platelets 
(Paretzke  et  al  1982).  Table  2  gives  the  acute  effects  of  whole  body  irradiation  at  differ¬ 
ent  doses.  Large  total  body  doses  impair  most  severely  regenerative  cells  of  the  bone 
marrow,  proliferative  cells  of  the  intestinal  epithelium  and  the  endothelial  cells  of  the 
microvascular  system.  As  a  consequence  a  number  of  effects  evolve  in  which  phagocytic 
function  is  impaired,  electrolyte  balance  is  not  maintained  and  gastrointestinal  integrity 
is  affected.  The  result  is  loss  of  defenses  to  infection  and  spread  of  internal  infections. 


Table  1.  Cell  sensitivity  to  radiations  in  the  decreas¬ 
ing  order. 


1 .  Lymphyocytes 

2.  White  blood  cells  and  red  bone  marrow 

3.  Intestinal  crypt  cells 

4.  Cells  of  the  reproductive  organs  (gonads) 

5.  Liver  cells 

6.  Cells  of  glands 

7.  Cells  of  the  connective  tissue 

8.  Kidney  cells 

9.  Muscle  cells 

10.  Nerve  cells 
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Figure  3.  Acute  effect  of  irradiation  on  neutrophils  (Source:  Paretzke  et  al  1982). 


TIME  AFTER  EXPOSURE , DAYS 

Figure  4.  Acute  effect  of  irradiation  on  platelets  (Source  as  in  figure  3). 
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Table  2.  Effects  of  acute  whole-body  radiation  doses  (based  on  Glasstone  and  Sesonske 
1963) . 


Acute  dose,  rems 

Probable  clinical  effect 

0  to  50* 

No  observable  effects. 

50*  to  100 

Slight  blood  changes  but  no  other  observable  effects. 

100  to  200 

Vomiting  in  5  to  50%  of  cases  within  3  hours,  with  fatigue  and  loss  of  appetite. 
Moderate  blood  changes.  Recovery  will  occur  in  all  cases  within  a  few  weeks. 

200  to  600 

For  doses  of  300  rems  and  more,  all  exposed  individuals  will  exhibit  vomiting 
within  2  hours  or  less.  Severe  blood  changes  accompanied  by  hemorrhage  and 
infection.  Loss  of  hair  after  2  weeks  for  doses  over  300  rems.  Recovery  in  20  to 
100%  within  1  month  to  a  year. 

600  to  1000 

Vomiting  within  1  hour,  severe  blood  changes,  hemorrhage,  infection,  and  loss 
of  hair.  From  80  to  100  %  of  exposed  individuals  will  succumb  within  2  months; 
those  who  survive  will  be  convalescent  over  a  long  period 

*  This  number  is  sometimes  given  as  25.  However,  there  is  a  recent  tendency  to  believe  that  observable 
effects  of  radiation  do  not  begin  until  about  50rem. 


Three  different  radiation  syndromes,  following  acute  exposure,  have  been  noted.  The 
first  of  these,  occurring  at  2  to  4  Gy,  is  the  bone  marrow  syndrome  in  which  due  to  heavy 
killing  of  blood  cells,  particularly  stem  cells  of  the  bone  marrow,  there  is  very  high 
susceptibility  to  infection  and  loss  of  nourishment.  At  somewhat  higher  doses,  4-6  Gy, 
the  gastro-intestinal  tract  is  severely  affected  leading  to  internal  hemorrhage  and 
infections.  The  central  nervous  system  syndrome  follows  at  very  high  doses  of  10  Gy  or 
more. 

3.4  Low  level  radiation  effects  and  the  latent  period 

Low  levels  of  radiation  exposure  do  not  cause  any  immediately  observable  effect. 
Protraction  of  dose  over  a  long  period  of  time  causes  much  less  severe  effects  than  if  the 
same  dose  were  delivered  acutely.  Several  animal  studies  have  shown  these  differences 
in  radiation  effects.  To  cite  one  example  a  recent  study  on  the  effects  of  strontium  90,  a 
low  let  beta  particle  emitting  radionuclide  that  concentrates  in  the  skeleton  shows  that 
bone  cancer  is  induced  only  when  the  radiation  dose  rates  exceeded  about  1  rad  per  day 
for  several  years  i.e.  after  several  hundred  rems  of  dose  (Raabe  et  al  1981).  The 
American  National  Council  on  Radiation  Protection  and  Measurements  collected 
tumourigenesis  data  on  some  12  different  studies  on  laboratory  animals  covering  doses 
upto  200  or  300  rem  (ncrp  1980).  They  all  show  that  low-dose  rate  radiation  is  about 
two  to  ten  times  less  carcinogenic  for  the  same  doses  compared  with  radiation  delivered 
at  high  rates. 

There  is  considerable  incidence  of  natural  cancer  which  in  general  increases  with  age. 
Each  incremental  dose  of  radiation  causes  an  increase  in  this  rate  but  with  a  time  lag 
following  irradiation  in  which  no  medical  evidence  is  seen.  This  time  lag  is  referred  to  as 
the  latent  period  as  explained  in  figure  5  (nas  1980).  For  most  solid  tumors  this  is  of  the 
order  of  10-15  years  and  for  leukemia  it  is  about  3-5  years.  The  excess  rate  continues 
for  a  further  period  of  about  25  years  for  leukemia  and  lifetime  for  solid  cancers.  This  is 
known  as  the  period  of  expression  (nas  1980).  Thus  the  principal,  perhaps  the  sole, 
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Figure  5.  Radiation  cancer  incidence  spontaneous  and  following  exposure  and  latent  period 
(1)  (Source  nas  1980). 


radiation  effect,  at  low-dose  ranges,  in  the  exposed  population  is  the  somatic  risk  of 
increase  in  probability  of  cancer  induction.  It  is  important  that  the  evaluation  of  this 
increase  in  probability  of  cancer  induction  i.e.  risk  be  based  on  human  experience  and 
not  on  animal  studies. 


4.  Somatic  effects:  Cancer  induction 

The  most  important  somatic  effect  of  concern  for  radiation  protection  purposes  is  the 
induction  of  cancer  in  different  tissues.  Table  3  gives  the  sensitivity  to  natural  and 
radiation-induced  cancer  of  various  sites.  To  determine  this  a  considerable  number  of 
epidemiological  surveys  have  been  made  of  human  populations  who  have  been  exposed 
to  known  absorbed  doses  and  have  been  followed  with  adequate  ascertainment  of 
cancer  incidence  or  mortality,  for  long  periods  that  are  needed  to  estimate  the 
frequency  of  cancer  induced  by  radiation  (unscear  1977).  Reliable  information  in  this 
connection  has  come  from  three  main  types  of  exposures  (i)  radiological  procedures, 
therapeutic  (for  benign  conditions)  and  diagnostic,  (ii)  Japanese  survivors  of  atom 
bomb  attack  and  people  of  Marshall  Islands  affected  by  fallout  from  weapon  testing, 
(iii)  occupational-exposure  in  uranium  and  other  hard  rock  miners  and  in  radium  dial 
painters. 

4.1  Exposure  in  radiotherapy  and  diagnostic  radiology 

Ankylosing  spondylitis  has  been  treated  with  x-rays  as  well  as  intravenous  injection  of 
radium  224.  The  former  led  to  enhanced  mortality  from  leukemia  related  to  bone 
marrow  dose  and  the  latter  to  an  increased  incidence  of  bone  sarcoma  (Court  Brown 
and  Doll  1965;  Spiess  and  Mays  1973).  Certain  other  organs  such  as  stomach,  lung 
lymphoid  tissues  and  intestine  exposed  within  the  radiation  fields  also  showed 
increased  mortality  due  to  cancer  related  to  the  mean  absorbed  dose  in  these  organs 
(Smith  and  Doll  1978).  Similarly  other  radiotherapeutic  treatments  have  yielded 
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Table  3.  Organ  sensitivity  to  radiation-induced  cancer 


Site  of 

Cancer 

Natural 

incidence 

Radiation- 

induced 

Female 

breast 

Very  high 

High 

Leukemia 

Moderate 

Very  high 

Thyroid 

Low 

Very  high 

Lung 

Very  high 

Moderate 

Stomach  and 
colon 

High 

Moderate 

estimates  of  cancer  induction  in  breast,  thyroid,  brain,  skin,  etc  (Shore  et  al  1977;  Baral 
et  al  1977;  Hempelmann  et  al  1975;  Modan  et  al  1974;  Shore  et  al  1976).  Treatment  of 
benign  diseases  of  the  uterus  by  irradiation  has  provided  risk  estimates  of  pelvic  organs 
(Smith  and  Doll  1976).  Patients  treated  for  thyroid  cancer  by  radioiodine  have  given 
indications  of  leukemia  incidence  (Pochin  1 969). 

Diagnostic  exposures  being  much  lower,  only  in  three  situations  the  data  has  yielded 
information  on  cancer  induction:  firstly  breast  cancer  increase  in  female  patients 
who  underwent  repeated  fluoroscopy  during  lung  collapse  therapy  for  tuberculosis 
(Boice  et  al  1979),  secondly  incidence  of  leukemia  and  liver  cancer  in  patients  who  were 
given  thorotrast  as  a  contrast  medium  (unscear  1977);  and  thirdly  increase  in  leukemia 
and  other  malignancies  when  the  foetus  got  exposed  during  pelvic  or  abdominal 
radiographic  examination  of  the  mother  (Stewart  and  Kneale  1970). 

4.2  Japanese  survivors  and  Marshall  islanders 

The  exposed  surviving  populations  from  the  cities  of  Hiroshima  and  Nagasaki  (victims 
of  atom  bomb  attacks)  have  shown  enhanced  frequencies  of  leukemia  and  at  least  nine 
other  forms  of  malignancy  namely  the  lung,  female  breast,  thyroid  and  salivary  glands, 
lymphoid  tissues,  bladder  and  urinary  organs,  aesophagus,  stomach  and  large  intestine 
(Beebe  et  al  1977).  Although  dose  estimates  are  being  revised  (Lowe  1982)  which  may 
change  the  earlier  derived  risk  factors,  the  fact  remains  that  these  studies  have  provided 
extensive  human  experience.  Fallout  from  a  weapon  test  on  Bikini  in  1954  resulted  in 
ingestion  of  radio-iodine  by  some  Marshall  islanders.  These  have  provided  risk 
estimates  for  cancer  of  the  thyroid  (Conrad  et  al  1975). 

4.3  Occupation  exposures 

Estimates  of  lung  cancer  induction  by  alpha  radiation  from  radon  daughter  products  to 
the  bronchial  epithelium  among  uranium  and  other  hard  rock  miners  have  given 
relatively  consistent  risk  estimates  for  the  lung  (unscear  1977).  Uncertainties  in 
radiobiological  effectiveness  of  this  radiation  and  the  carcinogenic  effect  of  smoking 
complicate  the  general  applicability  of  these  estimates.  Definite  evidence  and  corre¬ 
lation  of  bone  cancer  have  been  obtained  from  radium  dial  painters  and  from  people 
who  were  administered  radium  for  its  supposedly  therapeutic  effect  (Rowland  et  al 
1971).  Radiologists  who  were  exposed  to  substantial  occupational  doses  during  the 
early  part  of  this  century,  in  spite  of  uncertainty  of  dose  received,  provide  assessments 
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of  excess  over  expected  different  fatal  malignancies.  From  this  relative  frequency  of 
leukemia  as  compared  with  all  other  malignancies,  for  more  or  less  uniform  irradiation 
of  the  whole  body,  can  be  obtained  (Matanoski  et  al  1975). 


5.  Somatic  risk  estimation 

5.1  Dose-response  models 

We  have  seen  above  that  the  cancer  incidence  data  available  is  at  high  doses  and  high 
dose  rates.  The  need,  however,  is  to  estimate  the  risk  of  cancer  induction  at  very  low 
doses  and  dose  rates.  Therefore,  it  becomes  necessary  to  construct  dose  response 
models  to  fit  the  available  data  and  extrapolate  it  to  low  doses.  It  is  this 
modelling  and  extrapolation  which  has  been  a  matter  of  controversy.  In  recent 
years,  a  general  hypothesis  for  estimation  of  excess  cancer  risk  in  irradiated  human 
populations,  based  on  theoretical  considerations,  on  extensive  laboratory  animal 
studies,  and  on  limited  epidemiological  surveys  has  been  developed  which  takes  the 
complex  quadratic  form  (nas  1980): 

7(D)  =  (a0  -f  ajD  +  a2D2)  exp  (-^D  -  j52Z)2), 

where  /  is  the  cancer  incidence  in  the  exposed  population  at  dose  D(cGy). 

This  multicomponent  dose-response  curve  contains  initial  upward  curving  linear 
(aj  and  quadratic  (a2)  functions  of  dose,  which  represent  the  process  of  cancer- 
induction  by  radiation,  and  a  modifying  exponential  function  of  dose  which  represents 
the  competing  effect  of  cell-killing  at  high  doses  (Px  and  /?2).  The  constant  a0  accounts 
for  the  natural  incidence  of  cancer  in  the  population.  Attempts  to  fit  the  available  data 
have  demonstrated  that  for  different  radiation-induced  cancers  only  certain  of  the 
parameter  values  of  these  constants  can  be  theoretically  determined.  Therefore,  simpler 
models  for  the  dose-response  relationship  in  the  low-level  dose  range  are  advocated  as 
shown  in  figure  6,  namely, 

(i)  linear  1(D)  =  a0  +  ctlD, 

(ii)  quadratic  1(D)  =  0L0  +  a2D2, 

(iii)  linear  quadratic  /(Z))  =  a0  +  a1D  +  a2D2. 

5.2  Risk  projection  models 

The  US  National  Academy  of  Sciences  (1980),  beir-iii  report  and  the  United  Nations 
Scientific  Committee  on  Effects  of  Atomic  Radiation  (1977)  report  use  two  models:  the 
absolute  risk  model  and  the  relative  risk  model.  The  absolute  risk  is  the  expression  of 
excess  cancer  risk  due  to  radiation  exposure  as  the  arithmetic  difference  between  the 
risk  among  those  exposed  and  that  occurring  in  the  absence  of  exposure  (figure  7).  This 
model  considers  the  latent  period  and  the  period  of  expression.  The  absolute  lifetime 
risk  co-efficient  is  expressed  as  the  total  number  of  excess  cancer  cases  in  the  exposed 
population  per  unit  collective  dose.  The  relative  risk  is  the  expression  of  cancer  risk  due 
to  exposure  as  the  ratio  of  the  risk  among  the  exposed  population  to  that  occurring  in 
the  absence  of  exposures  (figure  7).  Therefore,  the  excess  risk  is  a  multiple  of  the 
spontaneous  age-specific  cancer  rate  in  that  cohort  population.  The  greater  the  natural 
rate  of  cancer  incidence  in  a  population  (e.g.  ageing  population)  the  greater  will  be  the 
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Figure  6.  Radiation  risk  models  (source  Goldman  1982). 


Lifetime  Expression,  Comparison  of 
Absolute  and  Relative  Risk  Models 


Limited  Expression  Time,  Relative  Risk  Model, 
Comparison  of  Two  Age  at  Exposure  Groups 


Figure  7.  Models  of  absolute  and  relative  risk  and  age  dependence.  Xe  is  age  at  exposure  and 
1  is  the  minimum  latent  period  (Source  nas  1980). 
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Table  4.  Risk  estimates  for  individual  organs — cancer 
risk.  Fatalities  per  rem  per  106  persons. 


Organ 

UNSCEAR 

1977 

BEIR  III 

1980 

icrp  26 

1977 

Blood 

(leukemia) 

15-25 

22 

20 

Breast 

50 

87 

25 

Lung 

25-50 

42 

20 

Thyroid 

10 

2-2  M 

5 

5-8  F 

Bone 

25 

— 

5 

Stomach 

10-15 

15  M 

10 

17  F 

Liver 

10-15 

8 

10 

Lower  large 

intestine 

10-15 

— 

10 

Gonads 

40 

susceptibility  of  the  individuals  comprising  that  population  to  radiation-induced 
cancer.  Since  epidemiological  studies  on  human  population  are  not  complete  as  yet  (not 
followed  upto  death)  there  can  be  wide  differences  in  risk  estimation  by  the  two 
methods  at  any  one  period  of  follow-up.  When  the  study  is  complete  so  that  no  more 
cancers  occur  in  the  studied  population,  both  models  should  lead  to  the  same  numerical 
estimates  of  lifetime  cancers  though  the  risk  distribution  may  be  different  in  the 
exposed  population.  The  beir-iii  report  emphasised  that  some  experimental  and  human 
data,  as  well  as  theoretical  considerations,  suggest  that  for  exposure  to  low  doses  of  low 
let  radiation  such  as  x-rays  and  gamma  rays,  the  linear  model  probably  overestimates 
but  can  be  used  to  define  the  upper  limits  of  risk  of  most  radiation-induced  cancers  in 
man.  Similarly  the  pure  quadratic  model  may  be  used  to  define  the  lower  limits  of  risk 
from  low  dose,  Iow-let  radiation.  For  exposures  to  high-LET  radiation  (neutron,  alpha 
particles,  etc)  linear  risk  estimates  are  less  likely  to  overestimate  the  risk  and  may,  in  fact, 
underestimate.  The  International  Commission  on  Radiological  Protection  has  used  the 
linear  extrapolation  to  estimate  risks.  Table  4  gives  the  risk  estimates  for  individual 
organs  by  different  bodies. 


6.  Genetic  effects 

The  induction  of  genetic  mutations  by  radiations  has  been  known  now  for  more  than 
five  decades.  A  large  amount  of  information  is  available  on  the  frequency  with  which 
certain  hereditarily  transmitted  abnormalities  are  caused  in  the  mouse,  as  well  as  in 
various  vegetable  and  other  animal  forms,  by  irradiation  even  at  low  doses  (unscear 
1977;  Oftedal  and  Searle  1980).  In  man,  however  no  corresponding  information  has 
been  obtainable  on  the  rate  of  induction  of  harmful  genetic  effects  by  radiation.  But  the 
frequency  of  chromosomal  aberrations  is  known  in  human  blood  lymphocytes  cultured 
after  exposure  of  the  blood  ‘in  vitro’  or  after  irradiation  received  in  vivo  in  the  course  of 
medical  or  other  procedures  at  known  dose.  But  these  cannot  form  a  basis  for  genetic 
-risk  estimation.  The  latter  estimates  must  at  present  be  based  largely  upon  mice  data  of 
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radiation  exposure  required  to  double  the  normal  frequency,  the  doubling  dose.  This 
doubling  dose  for  twelve  different  types  of  hereditary  abnormality  in  male  or  female 
mice  have  ranged  from  04  to  2  6  with  a  mean  of  14  (±0*2,  S.E)Gy  (Searle  1977). 

Genetic  risk  estimates  have  been  obtained  assuming  that  a  comparable  doubling 
dose,  taken  as  1  Gy  for  Iow-let  low-dose  rate  radiation,  is  a  characteristic  of  genetic 
damage  in  mammals  and  may  also  apply  to  man  (unscear  1977).  Knowing  the  normal 
frequency  of  all  genetic  abnormalities  that  are  maintained  by  mutation,  a  risk  estimate 
of  major  genetic  harm  expressed  in  the  first  generation  has  been  derived  as 
0*9  x  10"2Gy_1  with  0*35  x  10" 2  Gy-1  in  the  second,  and  a  total  of  3*2  x  10" 2  Gy-1 
in  all  generations  (Oftedal  and  Searle  1980).  unscear  (1977)  estimated  0*65 
x  10" 2  Gy  - 1  for  first  generation  and  a  total  of  1*85  x  10“ 2  Gy' 1  for  all  generations. 
icrp  (1977a)  has  used  a  value  of  10~2Gy_1  for  the  first  two  generations  and  2 
x  10" 2  Gy-1  for  all  generations. 


7.  Risk  limitation  through  system  of  dose  limits  in  radiation  protection 

It  is  believed  that  like  genetic  mutations,  for  cancer  induction  also  the  initiating 
processes  are  chromosomal,  since  the  abnormal  reproductive  behaviour  of  transformed 
cells  must  be  transmitted  in  subsequent  cell  divisions  (Pochin  1980;  Rossi  and  Kellerer 
1972).  The  dose-effect  relationship  is  thus  likely  to  have  the  same  general  form  as  for 
genetic  effects.  Cancer  induction  and  genetic  effects  are  therefore  classed  as  stochastic 
effects;  where  there  is  no  threshold  and  there  is  a  probability  of  induction  at  all  doses 
however  low.  Non-stochastic  effects  on  the  other  hand  occur  if  their  threshold  is 
exceeded  and  their  severity  depends  on  the  amount  of  dose. 

Non-stochastic  effects  are  prevented  by  ensuring  that  no  tissue  receives  annual  doses 
which  reach  threshold  values  after  a  lifetime,  icrp  (1977a)  has  recommended  that  no 
tissue  shall  receive  more  than  0  05  Sv  annually  under  conditions  of  occupational 
exposure  and  0*5  Sv  per  year  in  members  of  the  general  public. 

The  stochastic  effects  are  not  preventable  but  their  frequency  of  occurrence  can  be 
limited.  It  is  important,  therefore,  that  unnecessary  exposure  shall  be  avoided  and 
unavoidable  exposure  shall  be  as  low  as  reasonably  achievable  (alara).  The  system  of 
dose  limits  that  is  adopted,  therefore,  indicate  maximum  values  which  should  never 
ordinarily  be  exceeded.  The  radiation  safety  of  an  occupation  depends  on  the  risks 
corresponding  to  the  average  annual  exposure  received.  In  order  to  compare  the  safety 
of  occupations  involving  radiation  exposures  with  those  not  involving,  accounts  should 
be  ideally  taken  of  the  whole  range  of  occupational  injuries,  diseases,  fatalities,  the 
mutagenic  or  carcinogenic  exposures  if  any,  and  a  variety  of  transient  or  permanent 
disabilities,  etc  (icrp  1977b).  Radiation  exposures  which  amount  to  an  average  annual 
value  of  5  mSv  may  cause  a  component  of  harm  equal  to  about  3-4  accidental  deaths 
annually  per  100,000  workers.  If  on  the  other  hand  the  working  conditions  are  such  that 
the  annual  dose  limit  of  50  mSv  was  commonly  approached  each  year  by  most  workers 
leading  to  an  average  dose  of  30mSv  per  year,  then  the  radiation  harm  would  be 
equivalent  to  about  20  accidental  deaths  annually  per  100,000  workers  (Pochin  1980). 

A  coherent  system  of  dose  limits  must  have  knowledge  of  and  suitably  equate  risks  of 
irradiating  individual  body  organs,  relative  to  that  of  uniform  whole  body  exposure  to 
ensure  compliance  with  the  limits.  To  achieve  this  the  icrp  (1977a)  has  used  weighting 
factors  based  upon  the  risks  of  fatal  cancers  of  different  organs  or  genetic  risks.  Thus. 


Radiation  protection  and  risk  evaluation 


291 


radiation  protection  has  come  a  long  way  from  the  earlier  empirical  though  safe,  dose 
limits  to  the  recent  more  scientifically  derived  sound  basis.  A  lot  more  data  and 
quantitative  information  on  several  aspects  of  radiation  effects  are  required  to  make 
radiation  protection  still  more  authoritative. 
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Electron  energy  loss  spectroscopy  (eels)  of  organic  molecules  in 
vapour  phase:  Design  and  fabrication  of  an  indigenous  eel 
spectrometer* 

M  S  HEGDE,  V  JAYARAM,  P  VISHNU  KAMATH  and 
CNR  RAOt 

Solid  State  and  Structural  Chemistry  Unit,  Indian  Institute  of  Science,  Bangalore  560  012, 
India 

Abstract.  An  indigenous  electron  energy  loss  spectrometer  has  been  designed  and  fabricated 
for  the  study  of  free  molecules.  The  spectrometer  enables  the  recording  of  low-resolution 
electronic  spectra  of  molecules  in  the  vapour  phase  with  ready  access  to  the  vacuum  ultraviolet 
region.  Electron  energy  loss  spectra  of  aliphatic  alcohols  and  carbonyl  compounds  as  well  as  of 
benzene  derivatives  have  been  recorded  with  the  indigenous  spectrometer  and  the  electronic 
transitions  in  these  molecules  discussed. 

Keywords.  Electron  energy  loss  spectroscopy;  vacuum  ultraviolet  spectroscopy;  eels. 

PACS  No.  33  20;  34  80 

1,  Introduction 

Electron  energy  loss  spectroscopy  (eels)  provides  an  effective  means  of  investigating 
electron  states  of  free  atoms  and  molecules  (Celotta  and  Huebner  1979;  Rao  et  al  1980). 
In  this  technique,  a  primary  electron  beam  of  energy  Eh  collides  with  atoms  or 
molecules  and  the  inelastic  collisions  cause  excitations  within  the  sample.  The  kinetic 
energy  Ef  of  the  inelastically  scattered  electrons  is  measured  and  the  difference, 
A  E  =  Et  —  Ef,  directly  gives  the  energy  absorbed  by  the  molecule  during  the  collision 
process  (figure  1).  eels  is  similar  to  optical  absorption  spectroscopy,  but  the  advantage 
of  eels  lies  in  the  fact  that  it  can  be  utilised  to  study  optically  forbidden  transitions 
(Kuppermann  et  al  1979;  Koerting  et  al  1984).  The  phenomenon  of  electron  exchange 
(Oppenheimer  1928)  whereby  the  scattered  electron  is  different  from  the  incident  one, 
enables  us  to  study  spin-forbidden  transitions.  Symmetry-forbidden  transitions  in 
molecules  can  also  be  observed  since  the  incident  electron  can  polarize  the  electron 
cloud  distribution  of  the  molecule  during  collision. 

What  is  specially  significant  about  eels  is  that  it  gives  us  ready  access  to  the  vacuum 
uv  region  of  electronic  spectroscopy  and  this  in  conjunction  with  photoelectron 
spectroscopy  provides  the  complete  molecular  energy  level  diagram  of  the  molecule 
under  study.  Since  an  eels  instrument  is  relatively  simple  and  inexpensive  to  fabricate, 
it  can  be  regarded  as  a  laboratory  vacuum  uv  spectrometer  of  low  resolution  for  studies 
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Figure  1.  Schematic  illustration  of  the  electron  energy  loss  technique  for  electronic 
excitations. 


involving  identification  and  characterization  of  molecules  in  the  vapour  phase,  eels  has 
the  disadvantage  of  low  resolution,  a  feature  common  to  all  techniques  of  electron 
spectroscopy.  In  this  paper  we  report  the  design  and  fabrication  of  a  simple  eel 
spectrometer  devoted  to  the  study  of  atoms  and  molecules  in  the  gas  or  vapour  phase. 
We  discuss  the  eel  spectra  of  several  organic  molecules  recorded  with  this  indigenous 
spectrometer,  having  calibrated  the  spectrometer  with  gases  such  as  He,  02  and  N2. 


2.  Design  of  the  spectrometer 

The  spectrometer  consists  of  an  electron  beam  source,  a  monochromator,  electron 
lenses,  sample  chamber,  analyser  and  a  detector.  As  most  of  the  electronic  excitations  in 
the  uv  and  vacuum  uv  regions  lie  in  the  energy  range  of  4-10  eV,  the  energy  of  a  typical 
source  should  be  about  three  to  five  times  this  energy.  We  have  therefore  selected  a 
source  energy  in  the  range  of  30-70  eV.  The  design  criteria  for  fabrication  of  the 
spectrometer  were  based  on  the  three  important  factors  discussed  below. 

2.1  Incident  electron  beam  intensity 

As  the  electron  beam  passes  through  each  of  the  components  of  the  spectrometer,  there 
is  a  certain  loss  in  intensity.  This  loss  is  minimized  by  the  use  of  appropriate  electron 
optics;  the  number  of  inelastically  scattered  electrons  would  also  be  small,  being 
typically  two  orders  of  magnitude  less  than  the  incident  beam.  In  view  of  these  factors,  it 
is  necessary  to  have  an  intense  elastic  peak  of  at  least  105  counts/sec  at  the  mouth  of  the 
detector.  This  would  correspond  to  a  current  of  10” 14 amps.  Assuming  a  loss  in 
intensity  of  the  order  of  104  in  the  monochromator,  103  in  the  sample  chamber  and  a 
further  loss  of  10  in  the  analyser,  the  total  loss  over  all  the  components  would  be  of  the 
order  of  108.  The  electron  gun  source  must  therefore  be  capable  of  emitting  in  the  range 
of  1-10  microamps  in  the  energy  range  30-70  eV.  The  electron  source  was  designed 
with  these  specifications  in  view. 


eel  spectrometer 


295 


2.2  Resolution 

The  desired  resolution  of  a  spectrometer  of  this  kind  meant  for  the  study  of  organic 
molecules  (operating  under  low  resolution),  is  approximately  400  meV  for  a  primary 
energy  of  about  50  eV.  Resolution  of  this  order  can  be  obtained  using  spherical  sector 
analysers  (Kuyatt  and  Simpson  1967).  The  resolution  A  E/E  of  spherical  sector 
analysers  is  given  by  the  formula  A  E/E  =  S/2R0  where  S  is  the  slit  width  and  R0  is  the 
mean  radius  of  the  analyser.  If  we  use  a  convenient  slit  width  of  1  mm,  an  analyser  of  a 
mean  diameter  of  125  mm  is  needed. 

2.3  Vacuum  conditions 

Since  the  spectrometer  is  devoted  to  the  study  of  atoms  or  molecules  in  the  gas  or  the 
vapour  phase,  it  is  clear  that  two  separate  and  independent  pumping  systems  are 
required.  One  provides  the  base  vacuum  required  for  monochromator  and  the 
spectrometer  chambers,  while  the  other  provides  differential  pumping  for  the  stream  of 
sample  molecules.  The  sample  pressure  required  to  record  an  eel  spectrum  depends  on 
the  nature  of  the  sample,  as  well  as  the  length  of  the  sample  chamber.  The  sample 
pressure  must  be  such  that  the  mean  free  path  of  the  electrons  for  electron  impact 
excitation  must  be  less  than  the  length  of  the  collision  chamber.  The  mean  free  path  X  is 
defined  as  X  =  1  /no,  where  n  is  the  number  of  molecules  per  c.c.  and  o  is  the  inelastic 
electron  impact  collision  cross-section.  Assuming  a  to  be  in  the  range  10“ 1 5-10~ 16  cm2 
(McDaniel  1964)  for  a  X  of  approximately  25  mm,  the  sample  pressure  required  is  of  the 
order  of  10“ 4— 10  “ 3  torr.  The  desired  base  vacuum  of  the  spectrometer  on  the  other 
hand  is  of  the  order  of  10" 6  torr.  Hence  the  crucial  need  would  be  for  an  effective 
differential  pumping  of  the  sample.  The  vacuum  system  was  designed  keeping  these 
requirements  in  view. 

3.  Fabrication  of  the  spectrometer 

3.1  Electron  source 

A  schematic  diagram  of  the  electron  gun  used  in  the  spectrometer  is  shown  in  figure  2.  A 
thoriated  tungsten  filamet  of  0T  mm  dia  is  mounted  on  a  pair  of  Ni  rods.  The  filament  is 


Figure  2.  Schematic  diagram  of  the  electron  gun  used  in  the  spectrometer.  OC:  outercasing; 
VF:  vacuum  feed  through;  R:  reflecting  electrode;  W:  tungstun  filament;  G;  grid;  CM: 
collimator;  MF:  main  flange. 


296 


M  S  Hegde  et  al 


resistively  heated  with  a  constant  current-stabilised  power  supply  fabricated  in  the 
laboratory.  The  electron  beam  current  is  controlled  by  using  a  negatively  biased  grid. 
The  gun  is  mounted  on  a  63  mm  diameter  brass  flange  and  enclosed  in  a  stainless  steel 
casing  as  shown  in  figure  2.  Electrons  with  energy  of  30-70  eV  were  obtained  in  this 
manner. 

When  the  filament  is  fitted  to  a  reflecting  electrode  R,  which  is  at  the  same  voltage  as 
the  filament,  an  emission  current  of  5-10  /iarnp  can  be  obtained. 

3.2  Monochromator  and  analyser 

The  monochromator  as  well  as  the  analyser  consist  of  two  hemispherical  sectors  of 
mean  diameter  125  mm.  The  hemispherical  sectors  have  been  machined  out  of  Al 
blocks  to  an  accuracy  of  +  0  025  mm.  The  hemispherical  sectors  are  gold-coated  by 
vacuum  deposition  employing  planetary  rotation,  to  ensure  uniform  coating  of  the 
spherical  surfaces.  The  sectors  are  then  mounted  on  a  6  mm  thick  aluminium  support 
plate  with  suitable  electrical  insulation  (figure  3);  the  electrical  insulations  are  also 
shown  in  the  figure. 


Figure  3.  Schematic  illustration  of  the  analyser  assembly.  Shaded  areas  are  machined  out  of 
teflon.  IS:  inner  sphere;  OS:  outer  sphere;  SP:  support  plate;  MF:  main  flange.  Inset  shows  the 
details  of  insulation  employed  in  the  assembly  of  the  analyser. 


The  performance  of  the  spherical  sector  analysers  was  measured  over  a  wide  range  of 
electron  beam  energies  (25-100  eV)  of  interest  to  us.  The  analyser  response  was  found 
to  be  linear  over  the  30-70  eV  range.  Analysers  of  mean  diameter  1 14  mm  and  139  mm, 
have  also  been  machined  and  tested  for  their  linear  response.  The  analyser  performance 
was  found  to  improve  on  coating  the  slits  with  gold. 

3.3  Electrostatic  lenses 

An  electrostatic  lens  systems  was  machined  as  described  elsewhere  (Spangenberg  and 
Field  1943;  Lassettre  et  al  1963).  The  lens  assembly  (figure  4)  consists  of  a  pair  of 
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Figure  4.  Schematic  illustration  of  the  electrostatic  lens  assembly  employed  in  the 
spectrometer.  Tj  and  T2  are  cylindrical  tubes  comprising  the  lens  system.  The  shaded  area  is 
machined  out  of  teflon.  MS:  monochromatiser  slit;  J:  outer  jacket;  SC:  sample  chamber  slit. 


identical  hollow  cylindrical  tubes  mounted  co»axially  in  an  outer  jacket  and  insulated 
electrically  from  one  another,  the  tubes  being  separated  from  each  other  by  a  distance 
equal  to  one-tenth  of  their  diameter.  The  lens  system  mounted  on  the  “out”  slit  of  the 
monochromator,  acts  as  a  collimator  and  helps  to  focus  the  beam  emerging  from  the 
monochromator  into  the  sample  chamber.  The  cylindrical  lenses  were  machined  from 
stainless  steel  rods  of  suitable  dimensions  to  an  accuracy  of  ±0-025  mm  and  coated 
with  gold.  The  lens  system  was  found  to  work  with  negligible  chromatic  aberration. 

3.4  Sample  chamber 

The  sample  chamber  was  constructed  out  of  a  stainless  steel  bellow  25  mm  in  diameter 
and  25  mm  in  length.  It  was  fitted  between  two  brass  collars,  which  were  in  turn  welded 
to  the  sample  input  and  output  lines.  The  gaseous  sample  is  introduced  through  a 
sensitive  needle  valve.  When  the  sample  pressure  in  the  sample  chamber  was 
10“ 3— 1 0-4  torr,  the  base  vacuum  in  the  spectrometer  chambers  increases  from 
2  x  1 0  _  6  torr  to  4  x  1 0 ~  6  torr.  The  electron  beam  emerging  from  the  lens  system  meets 
the  stream  of  sample  molecules  at  right  angles  in  the  sample  chamber. 

The  flexibility  of  the  bellow  permits  us  to  collect  the  scattered  electron  current  along 
off  normal  angles.  A  study  of  the  angular  dependence  of  the  scattered  electron  current 
can  provide  information  on  the  shapes  and  symmetries  of  orbitals. 

3.5  Detector  and  ratemeter 

A  Mullard  channel  electron  multiplier  (B419BL)  was  used  to  amplify  the  electrons 
coming  from  the  energy  analyser;  this  is  the  only  imported  component  in  the 
spectrometer.  The  electron  multiplier  responds  to  an  input  of  one  electron  upto  about 
108  electrons  and  its  duration  (full-width  at  half-maximum)  is  about  10  nanosecond. 
The  maximum  operating  voltage  of  the  multiplier  is  around  3-5  kV,  the  nominal  gain  at 
2-5  kV  being  about  IT  x  108  at  a  nominal  resistance  of  5  x  109  ohm. 

The  high  impedence  transducer  (detector)  such  as  the  electron  multiplier  requires  an 
amplifier  which  converts  a  transfer  of  charge  into  a  change  of  voltage.  In  the  ratemeter 
(figure  5),  we  used  a  hybrid  charge-sensitive  preamplifier-discriminator  (amptek  A- 101 
pad)  to  convert  pulses  of  charge  to  pulses  of  voltage.  This  is  capable  of  handling  counts 
upto  a  maximum  of  4  x  106  with  pulse  pair  resolution  of  250  nsec.  The  input  threshold 
could  be  varied  from  1-6  x  10“  13  coulomb  to  15  x  10“ 12  coulomb. 

The  output  from  the  charge-sensitive  amplifier  is  fed  to  the  decad  counter  and 
monostable  multivibrator.  This  positive  pulse  is  fed  to  the  voltage  level  translator 
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Figure  5.  Block  diagram  for  the  construction  of  ratemeter. 


which  produces  the  constant  1 2  V  positive  pulse.  This  constant  height  pulse  is  fed  to  the 
‘frequency  to  voltage’  converter  and  buffer.  Reference  voltage  is  used  to  bias  the  voltage 
level  translator  and  also  to  introduce  zero  bias  to  ‘frequency-to-voltage’  converter.  The 
output  dc  voltage  from  the  ratemeter  is  fed  to  the  Y  axis  of  the  X-Y  recorder  which 
directly  gives  the  number  of  counts  per  second. 

3.6  General  assembly  and  the  vacuum  system 

The  spectrometer  consists  of  two  main  chambers  241  mm  in  diameter  and  228  mm  in 
length  fitted  to  brass  collars  of  304  mm  diameter.  The  main  chambers  are  made  by 
rolling  3  mm  stainless  steel  plates  into  cylinders  with  all  the  joints  being  accomplished 
by  silver  brazing.  Both  the  chambers  have  been  provided  by  two  more  sheets  of  /i-metal 
for  shielding  the  earth’s  magnetic  field. 

The  general  assembly  of  the  spectrometer  is  shown  in  figure  6  and  the  electronic 
control  in  figure  7.  The  source,  monochromator  and  the  electrostatic  lens  are  fitted  to  a 
300  mm  diameter  Al  flange  12  mm  in  thickness,  and  is  fitted  to  the  first  chamber.  The 
monochromator  chamber  is  mounted  on  a  1 50  mm  diameter  oil  diffusion  pump  which 
is  backed  by  a  300  lit/min  rotary  pump.  This  consists  of  the  main  pumping  system  and  is 
capable  of  providing  an  ultimate  vacuum  of  better  than  2  x  10" 6  torr,  with  a  total 
pump  down  time  of  less  than  30  min. 

The  analyser  and  detector  are  mounted  on  a  300  mm  diameter  Al  flange  of  12  mm 
thickness.  This  flange  is  fitted  to  the  analyser  vacuum  chamber.  The  analyser  chamber  is 
also  connected  to  the  main  vacuum  system  through  a  25  mm  diameter  flexible  tubing. 
The  collision  chamber  is  fitted  between  the  two  spectrometer  chambers  linking  the 
monochromator  output  to  the  analyser  input.  The  collision  chamber  is  fitted  to  a 
100  mm  oil  diffusion  pump  backed  by  a  300  lit/min  rotary  pump  for  differentially 
pumping  the  sample. 

Both  the  spectrometer  chambers  are  provided  with  pirani  and  hot  cathode  ionisation 
gauges  for  measurement  of  vacuum.  The  sample  pumping  line  is  also  fitted  with 
separate  gauge  heads  for  measuring  the  sample  pressure.  A  photograph  of  the 
instrument  in  final  assembly  is  shown  in  figure  8. 
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Figure  6.  General  assembly  of  the  spectrometer. 
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Figure  7.  Schematic  illustration  of  the  electronic  controls  employed  in  the  spectrometer. 


4.  Operation  of  the  spectrometer 

The  vacuum  system  is  first  switched  on.  On  achieving  the  base  vacuum  of  the  system 
(2xl0-6  torr),  the  source  gun  is  put  on  and  degassed  for  10  to  15  min.  The 
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Figure  8.  Photograph  of  the  working  spectrometer  in  final  assembly. 


collision  chamber  is  pumped  through  the  sample  pumping  line  till  the  base  vacuum  in 
the  sample  pumping  line  (4  x  10  ~ 6  torr)  is  of  the  same  order  as  that  in  the  main 
pumping  system.  The  source  energy  is  selected  to  be  in  the  50-70  eV  range  and  the 
monochromator  and  lens  voltages  are  suitably  altered  until  the  Faraday  collector  C  (see 
figure  6)  mounted  at  the  mouth  of  the  analyser  collects  a  maximum  beam  current 
(~  1 0 _  1 2—  1 0  9  amps).  This  beam  enters  the  analyser.  A  high  voltage  of  2-2-2-S  kV  is 
applied  to  the  multiplier  and  the  ratemeter  is  switched  on.  The  analyser  voltage  is 
scanned  using  a  dc  ramp.  The  spectra  are  recorded  by  scanning  the  voltage  across  the 
analyser  by  keeping  the  retard  voltage  at  zero.  We  can  also  scan  the  retard  voltage 
applied  to  the  analyser  mounting  plate  by  applying  a  constant  dc  voltage  to  the 
hemispheres.  The  elastic  peak  is  first  recorded  at  low  sensitivity  of  the  ratemeter  and 
having  found  the  elastic  peak,  the  sample  is  introduced  into  the  collision  chamber,  until 
the  sample  pressure  rises  to  10“ 1 0  3  torr.  The  elastic  peak  height  drops  to  half  its 
original  value  at  this  juncture  and  in  order  to  detect  the  loss  peaks,  the  ratemeter  is  then 
fixed  to  high  sensitivity  (typically  300  to  103  times  higher  than  the  sensitivity  used  to 
record  the  elastic  peak). 


5.  Performance  of  the  spectrometer 

The  spectrometer  is  readily  calibrated  by  recording  the  spectra  of  standard  gases  such 
as  He,  N2,  02  and  benzene  vapour.  The  spectrum  of  He  is  shown  in  figure  9a,  where 
peaks  due  to  excitations  into  the  various  excited  states  are  clearly  assigned.  The  peak 
due  to  the  1  lS  ->  2lS  excitation  can  also  be  assigned,  thus  demonstrating  the  versatility 
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of  the  spectrometer  in  detecting  optically  forbidden  transitions.  The  optically  allowed 
transitions  to  the  2 lP,  3*P  and  4 lP  excited  states  are  seen  as  prominent  peaks  and  are 
eminently  suited  for  calibration.  In  figure  9b,  we  show  the  spectra  of  pure  N2,  pure  02 
and  air.  The  spectrum  of  air  appears  as  due  to  a  mixture  of  N2  and  02.  The  electronic 
excitations  responsible  for  these  peaks  are  described  in  the  literature  (Lassettre  et  al 
1963;  Kuppermann  et  al  1968). 

At  a  primary  beam  energy  of  50  eV,  the  fwhm  of  the  elastic  peak  is  360  meV.  The 
resolution  of  the  spectrometer  is  0-72  %  at  50  eV  and  the  uncertainty  in  measurement  is 
±  0-05  eV. 
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Figure  9.  (a)  eel  spectrum  of  He  gas  recorded  in  the  spectrometer.  Inset  shows  the 

spectrum  on  a  expanded  scale,  together  with  the  assignment  of  the  transitions. 
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Figure  9.  (b)  eel  spectrum  of  N2,  02  and  air  recorded  for  the  purpose  of  calibration. 


6.  Study  of  eel  spectra  of  organic  molecules 

6.1  Aliphatic  alcohols  and  diethylether 

In  figure  10  we  show  the  eel  spectra  of  a  few  aliphatic  alcohols  and  diethylether.  The 
absorption  maxima  observed  in  these  compounds  are  listed  in  table  1.  These  molecules 
can  show  transitions  due  to  the  n  -*■  o*  and  o  ->  a*  excitations  (Rao  1975).  The  n  -+  a* 
transition  would  be  associated  with  lower  energies  and  intensities  compared  to  the 
a  — ►  o*  transitions  which  occur  in  the  vacuum  uv  region  with  high  extinction 
coefficients.  Accordingly  in  eel  spectra,  the  first  band  occurs  at  177  nm  (7  eV)  in 
CH3OH  which  compares  well  with  the  uv  absorption  band  of  174nm  reported  by 
Tsubomura  et  al  (1964).  This  band  is  due  to  the  n  -►  <7*  transition  involving  the  oxygen 
lone  pair.  The  first  band  in  CH3OH  is  quite  intense  and  sharp,  but  as  the  alkyl  group 
becomes  bulkier,  the  first  band  becomes  broader  and  less  intense.  The  broadening  of 
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Figure  10.  eel  spectra  of  aliphatic  alcohols. 


the  n  — ►  a*  band  is  expected  as  bulkier  alkyl  groups  provide  a  wide  spectrum  of  excited 
o *  states.  Secondary  and  tertiary  alcohols  exhibit  the  n  -*■  o*  band  at  lower  energies 
compared  to  the  primary  alcohols  due  to  enhanced  inductive  effect  of  the  alkyl  groups 
in  the  former. 

In  addition  to  the  n  ->  a*  band,  we  see  additional  bands  at  higher  energies  in  eel 
spectra  of  all  the  alcohols.  These  bands  are  broad  and  increase  in  complexity  as  the  alkyl 
groups  become  bulky.  These  bands  may  be  divided  into  two  categories.  The  first 
category  appears  in  the  158-151  nm  (7*8— 8-2  eV)  range  and  is  absent  in  CH3OH.  It 
appears  as  a  distinct  peak  in  C2H5OH  and  i-C3H7OH,  but  becomes  a  shoulder  in  the 
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Table  1 .  Absorption  maxima  in  the  vacuum  uv  region  for  some  aliphatic  alcohols 
and  halocarbons(a). 


Compound 

n  ->  a* 

(7-+<7*(b) 

CH3OH 

177(6-98) 

141(8-78) 

130(9-5) 

EtOH 

190(6-49) 

156(7-94) 

132(9-39) 

115(10-7) 

C2H5OC2H5 

187(6-62):  173(7-16) 

139(8-9) 

125(9-87) 

isopr-OH 

191(6*5) 

161(7-7) 

139(8-9) 

127(9-75)  114(10-82) 

npr-OH 

177(6-98) 

156(7-94) 

141(8-78) 

125(9-86) 

r-Bu-OH 

184(6-74) 

151(818) 

133(9-26) 

117(10-59) 

n-BuOH 

198(6-26) 

158(7-82) 

133(9-26) 

CH2C12 

171(7-22) 

151(818) 

139(8-90) 

CHCI3 

177(6-98) 

145(8-54) 

133(9-26) 

CC14 

174(7-09) 

141(8-78) 

133(9-26) 

(a)  Values  in  parentheses  are  transition  energies  in  eV. 

At  low  wavelengths,  some  Rydberg  transitions  may  be  mixed  with  the  a  -*  a*  transitions. 


higher  members  of  this  series.  This  feature  is  likely  to  be  due  to  the  <x  ->  a*  transition; 
since  it  is  absent  in  CH3OH,  we  may  assign  it  to  the  o  -►  cr*  transition  involving  the  C-C 
bond. 

The  second  category  of  o  -*■  <r*  transitions  occurs  at  wavelengths  below  1 50  nm 
(  >  8-2  eV).  There  is  a  band  appearing  in  the  140-130  nm  (8-8-9  5  eV)  range  and 
another  at  or  below  1 30  nm;  the  first  is  probably  due  to  the  o  -►  a*  transition  involving 
the  C-H  bond.  Below  1 30  nm  (  >  9-5  eV)  excitation  into  the  Rydberg  states  probably 
occurs. 

In  diethylether,  the  n  -+  o*  band  of  ethanol  splits  into  two  bands,  one  at  lower  and 
the  other  at  higher  energy  relative  to  ethanol. 

6.2  Carbonyl  compounds 

In  figure  1 1  we  show  the  eel  spectra  of  a  few  carbonyl  compounds.  The  carbonyl  group 
exhibits  n  ->n*,n  -+  o*  and  n-+n*  transitions  in  the  electronic  spectra  (Rao  1975);  the 
n  ->  n*  transition  has  a  low  extinction  coefficient  and  is  not  seen  in  the  eel  spectra 
recorded  by  us.  The  first  band  seen  in  eel  spectra  appears  in  the  energy  range 
180-190  nm  (6- 5-6*8  eV)  in  acetaldehyde  and  acetone  and  this  can  be  readily  assigned 
to  the  n  ->  o*  transition  of  the  carbonyl  group.  In  acetone,  it  appears  as  a  sharp  peak, 
while  in  acetaldehyde  it  is  a  shoulder  on  a  higher  energy  peak.  In  methylacetate  also  it 
appears  as  a  weak  shoulder.  This  band  is  however  not  seen  in  formaldehyde,  which 
shows  a  single  sharp  peak  at  172  nm  (7*2  eV)  due  to  the  n  n*  transition.  The 
transition  occurs  with  high  intensity  in  the  166-172  nm  (7  2-7*5  eV)  range  in  all  the 
carbonyl  compounds  studied  by  us  (see  table  2).  Below  155  nm  (8-0  eV),  carbonyl 
compounds  show  a  broad  band  with  a  multiplicity  of  peaks  due  to  the  a  -+  o* 
transitions. 

In  formic  and  acetic  acids,  the  peak  due  to  the  n  -►  n*  transition  occurs  at  153  nm 
(Rao  1975),  while  in  acetic  anhydride  it  is  around  166  nm  (7*5  eV)  just  as  in  other 
carbonyl  compounds.  Formic  acid  shows  no  other  peak,  but  both  acetic  acid  and  its 
anhydride  exhibit  a  broad  band  around  125  nm  (9-9  eV)  due  to  the  o  -*  o*  transitions 
of  the  methyl  group  (figure  1 2).  Presence  of  several  Rydberg  states  causes  broadening 
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Figure  11.  eel  spectra  of  aliphatic  carbonyl  compounds. 


of  the  o  -M7*  band  in  the  higher  energy  region.  Both  the  acids  show  a  slight  broadening 
of  the  n  ->  n*  peaks  possibly  due  to  dimerization. 

eel  spectra  of  a  series  of  aliphatic  amides  are  shown  in  figure  1 3.  The  band  due  to  the 
Ti  — ►  Ti*  transition  shifts  considerably  from  168  nm  (74  eV)  in  formamide  to  202  nm 
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Figure  12.  eel  spectra  of  formic  and  acetic  acids  together  with  the  spectrum  of  acetic 
anhydride. 


Table  2.  Absorption  maxima  seen  in  the  vacuum  uv  region  for  a  series  of 
carbonyl  compounds^. 


Compound 

n  ->  a* 

n  -*  n* 

a  —* 

HCHO 

— 

172(7-22) 

CH3CHO 

180(6-85) 

169(7-34) 

147(8-42) 

CH3COCH3 

192(6-44) 

— 

152(8-12) 

CH3COOCH3 

180(6-85) 

166(7-46) 

130(9-51) 

(CH3C0)20 

— 

166(7-46) 

— 

128(9-62) 

CH3COOH 

— 

153(8-1) 

— 

124(9-98) 

HCOOH 

— 

166(7-45);  153(8-1) 

— 

hconh2 

— 

168(7-34) 

— 

HCONH(Me) 

— 

177(6-98) 

— 

HCON(Me)2 

— 

202(6-13) 

— 

CH3CON(Me)2 

— 

194(6-37) 

— 

(a)  Values  in  parentheses  are  transition  energies  in  eV. 

(b)  Rydberg  transitions  may  be  mixed  up  with  the  a  -*  a*  transitions  at  low  wavelengths. 
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Figure  13.  eel  spectra  of  aliphatic  amides. 


(61  eV)  in  N-N  dimethylformamide.  Similar  shifts  are  seen  in  the  acetamide  series  as 
well  (table  2). 

6.3  Benzene  derivatives 

In  figures  1 4  and  1 5,  we  show  the  eel  spectra  of  benzene  and  several  of  its  mono-  and  di- 
substituted  derivatives.  Benzene  exhibits  three  bands  in  the  electronic  absorption 
spectra,  two  of  these  in  the  ultraviolet  region  (203  and  260  nm)  and  one  in  the  far 
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Figure  14.  eel  spectra  of  a  few  benzene  derivatives. 


Table  3.  Absorption  maxima  in  vacuum  uv  region  for 
some  aromatic  molecules 


0(nm) 

P(nm) 

a  (nm) 

c6h6 

184(6-74) 

Toluene 

181(6-86) 

p-xylene 

190(6-49) 

Chlorobenzene 

190(6-49) 

Bromobenzene 

184(6-74) 

Anisole 

189(6-55) 

214(5-77) 

271(4-57) 

Benzaldehyde 

198(6-25) 

234(5-29) 

271(4-57) 

Aniline 

190(6-49) 

224(5-53) 

278(4-45) 

N-Methyl  Aniline 

196(6-32) 

231(5-35) 

286(4-33) 

N-N-Dimethylaniline 

202(6-13) 

245(5-05) 

278(4-45) 

N-N-Diethylaniline 

198(6-25) 

257(4-81) 

294(4-21) 

p-Chlorotoluene 

190(6-49) 

224(5-53) 

p-Chlorophenol 

190(6-49) 

221(5-59) 

282(4-39) 

(a)  /?,  p  and  a  bands  correspond  to  the  183,  203  and  260  nm  bands  of 
benzene  respectively.  Values  in  parantheses  are  transition  energies  in  eV. 
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Figure  15.  eel  spectra  of  a  few  benzene  derivatives. 


ultraviolet  region  (180  nm).  The  last  of  these  is  the  most  intense  with  an  extinction 
coefficient  of  105  (Rao  1975).  The  high  symmetry  of  the  benzene  molecule  causes  the 
lower  energy  bands  to  have  low  intensities.  Consequently  in  eel  spectra  recorded  by  us, 
we  only  see  the  1 80  nm  (6-9  eV)  band  as  a  sharp  feature.  In  the  mono-  and  di-substituted 
derivatives,  the  D6n  symmetry  of  the  benzene  molecule  is  destroyed  and  we  begin  to  see 
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the  lower  energy  bands  as  well.  This  is  exemplified  in  benzaldehyde,  where  the  234  nm 
(5-3  eV)  band  has  comparable  intensity  with  the  180  nm  band.  Substituents  cause  a  red 
shift  of  the  180  nm  band  of  benzene  (table  3).  The  shift  is  specially  significant  in  aniline 
and  its  methylated  derivatives,  where  the  amino  group  can  donate  electrons  to  the 
aromatic  ring. 
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Neutron  diffraction  studies  of  transition  metal  nitrides 
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Abstract.  The  magnetic  structures  of  Fe4N  and  Mn4N  have  been  redetermined  using 
neutron  diffraction.  The  magnetic  form  factors,  obtained  from  polarised  neutron  data  have 
been  shown  to  be  different  for  the  face-centred  and  corner  atoms.  A  qualitative  explanation  of 
the  structures  of  Fe4N  and  Mn4N  has  been  provided  from  the  shapes  of  the  magnetic  form 
factors. 

Keywords.  Neutron  diffraction;  magnetic  form  factor;  nitrides 

PACSNo.  75  50 


1.  Introduction 

Transition  metal  nitrides  Fe4N  and  Mn4N  exhibit  interesting  magnetic  properties  with 
ferro  and  ferrimagnetic  ordering  respectively.  The  crystal  structure  has  space  group 
Pm3m  in  which  the  metal  atoms  form  a  fee  matrix  and  the  nitrogen  atom  occupies  the 
interstitial  body  centre  position  (figure  1).  The  corner  atom  (M has  twelve  metal 
atoms  as  its  nearest  neighbours  at  2*69  A  whereas  the  face  centre  atom  (M„)  has  two 
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Figure  1.  Unit  cell  of  transition  metal  nitride  M4N. 
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nearest  neighbour  nitrogen  atoms  at  1-9  A.  Because  of  this  difference  in  their 
environments,  the  corner  and  the  face  centre  atoms  show  different  magnetic  properties. 

These  compounds  have  been  extensively  studied  for  their  electronic  and  magnetic 
properties.  Following  the  pioneering  work  of  Guillaud  (1953)  and  Wiener  and  Berger 
(1955)  on  these  compounds,  Frazer  (1958)  undertook  a  neutron  diffraction  study  of 
Fe4N  and  concluded  that  it  is  ferromagnetic  with  magnetic  moments  of  2-73  pB  and 

1  -77  nB  for  the  corner  and  the  face  centre  atoms  respectively.  The  Mossbauer 
measurements  (Shirane  et  al  1 962)  show  that  the  hyperfine  fields  at  the  corner  and  face 
centre  atoms  are  approximately  proportional  to  their  magnetic  moments  of  3  pB  and 

2  pB  respectively  and  the  isomer  shifts  are  in  line  with  3 d1  4s  and  3 d  8  4s  configurations 
for  the  two  sites.  These  configurations  are  derived  from  the  assumption  that  the 
nitrogen  acts  as  an  electron  donor  to  the  face  centre  iron  atoms.  However,  the  electron 
diffraction  (Nagakura  1968)  yields  the  electronic  configuration  as  Fe^FeJ/3  )3N. 

Neutron  diffraction  studies  on  polycrystalline  Mn4N  (Takei  et  al  1960;  Takei  et  al 
1962)  show  that  it  has  a  ferrimagnetic  ordering  with  corner  atom  moment  of  3*85  fiB 
aligned  antiparallel  to  the  three  face  centre  moments  of  0-9  pB  at  77°K.  On  the  basis  of 
neutron  diffraction  and  neutron  polarisation  analysis  (Bouchard  1968;  Fruchart  et  al 
1979)  it  has  been  claimed  that  Mn4N  has  a  noncollinear  component  of  spin. 

Mn4N  has  been  subjected  to  a  variety  of  experimental  measurements  (Hisashi  et  al 
1967;  Mamoru  1966;  Fruchart  et  al  1979)  and  it  has  exhibited  anomalous  magnetic  and 
electronic  properties.  These  observations  have  led  to  the  band  model  calculations 
(Labbe  and  Jardin  1982;  Jardin  and  Labbe  1975, 1983)  based  on  a  strong  crystal  field  at 
the  Mnn  site  and  a  resonance  between  Mn  and  N  electronic  levels. 

Regarding  the  magnetic  structure  of  Fe4N  and  Mn4N,  the  existing  literature  appears 
to  show  an  incomplete  appreciation  of  the  importance  of  the  difference  in  the 
environments  of  the  two  species  of  metal  atoms  in  these  structures.  The  fact  that  the  two 
atoms  have  significantly  different  environments  leads  to  a  difference  in  their  magnetic 
moments.  It  should  immediately  follow  that  in  the  neutron  experiments  the  atoms  must 
also  have  different  magnetic  form  factors.  All  the  analysis  so  far  have  used  the  same 
magnetic  form  factors  for  the  two  sites,  which  leads  to  a  serious  flaw  in  the  conclusions 
reached. 

The  present  measurements  are  aimed  at  understanding  not  only  the  magnetic 
structures  but  the  magnetic  form  factors  in  these  compounds.  The  results  on  magnetic 
structures  agree  with  the  earlier  results  (Frazer  1958;  Takei  et  al  1960).  Interestingly, 
the  magnetic  form  factors  for  the  face  centre  Fe  and  Mn  atoms  in  Fe4N  and  Mn4N 
respectively  are  found  to  be  significantly  different  from  those  of  the  corner  atoms. 

s 

2.  Experimental 

2.1  Sample  preparation 

Fe4N  was  prepared  by  passing  dry  ammonia  over  4N  pure  iron  sponge,  kept  at  475°C, 
for  3  hr.  The  mixture  of  Fe4N  and  Fe3N  thus  formed,  was  heated  in  vacuum  at  450°C 
for  4  hr.  The  resulting  product  was  cubic  /  phase  of  Fe4N,  which  was  confirmed  by 
x-ray  analysis.  For  Mn4N,  nitrogen  gas  free  of  oxygen  was  passed  over  4N  pure 
manganese  powder  kept  at  600°C.  It  was  found  necessary  to  crush  the  resulting  product 
and  repeat  the  heat  treatment  a  couple  of  times  to  ensure  a  total  reaction.  X-ray  analysis 
confirmed  the  formation  of  single  phase  Mn4N. 
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2.2  Neutron  diffraction  measurements 

The  unpolarised  neutron  diffraction  measurements  were  conducted  at  the  triga  mark  ii 
reactor  pptn  Bandung,  with  neutron  wavelength  of  107  A.  Polarised  neutron 
diffraction  patterns  were  recorded  using  the  polarised  neutron  spectrometer  at  the 
cirus  reactor,  barc.  This  instrument  has  a  wavelength  of  0-92  A  and  the  polarisation 
and  flipping  efficiencies  of  97-5  %  and  99-9  %  respectively. 


3.  Results  and  discussion 

3.1  MnAN 

The  analysis  of  unpolarised  neutron  diffraction  profiles  shows  that  the  structure  is 
ferrimagnetic  with  magnetic  moments  of  3-50  nB  and  0*89  /iB  for  the  corner  and  the  face 
centre  atoms  respectively.  The  polarised  neutron  diffraction  patterns  are  given  in 
figure  2.  Unpolarised  neutron  data  is  also  shown  for  comparison.  The  depolarisation  of 
the  neutron  beam  in  the  sample  was  obtained  by  inserting  the  sample  before  the 
analyser  crystal  and  measuring  the  polarisation  of  the  neutron  beam.  From  the 
polarised  neutron  intensities,  the  magnetic  form  factor  for  the  face  centred  Mn  atom 
was  determined  using  Mn+  form  factor  (Watson  and  Freeman  1961)  for  the  corner 
atom.  Figure  3  shows  the  experimental  form  factor  for  Mn„  along  with  the  Mn+  and 
Mn++  (Watson  and  Freeman  1961)  form  factors.  The  form  factor  for  the  Mn„  is 
sharper  than  that  of  Mn,.  This  suggests  that  the  unpaired  electron  distribution  of  Mn„ 
is  much  more  spread  out  than  that  of  Mn,. 

This  is  in  agreement  with  the  band  model  (Jardin  and  Labbe  1975, 1983;  Labbe  and 
Jardin  1982)  for  the  transition  metal  nitrides  and  carbides.  The  model  considers  the 


Figure  2.  Polarised  and  unpolarised  neutron  diffraction  patterns  of  Mn4N. 
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Figure  3.  Magnetic  form  factor  of  face  centre  Mn  atom  in  Mn4N. 
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Figure  4.  Crystal  field  levels  of  face-centred  Mn  (d-orbitals)  and  N  (p-orbitals).  EF  is  Fermi 
energy  (Labbe  and  Jardin  1982). 
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crystal  field  effects  at  Mnn  site.  The  site  symmetry  of  Mn„  is  D4*  with  two  nitrogens  as 
nearest  neighbours.  The  3d-levels  of  Mn„,  in  this  strong  crystal  field,  split  into  a  ,  eg,  blg 
and  sublevels  as  shown  in  figure  4.  Due  to  the  large  crystal  field  splitting,  Hund’s  rule 
is  not  strictly  valid  and  the  magnetic  moment  is  reduced.  Further  the  p-level  of  nitrogen 
is  in  resonance  with  the  deg  level  of  Mnn  resulting  in  the  expansion  of  the  unpaired 
electron  density. 

The  experimental  observations  are  thus  in  qualitative  agreement  with  the  theoretical 
model.  The  expanded  moment  density  of  Mn„  also  implies  an  overlap  of  the  unpaired 
electron  distributions  of  Mn,  and  Mn„,  favouring  a  ferrimagnetic  alignment. 

3.2  Fe^N 

The  unpolarised  neutron  diffraction  pattern  of  Fe4N  was  recorded  using  neutrons  of 
wavelength  107  A.  The  intensities  of  various  reflections  were  measured  by  applying  a 
magnetic  field  parallel  to  their  respective  scattering  vectors.  The  magnetic  structure, 
obtained  using  this  data,  gave  a  ferromagnetic  alignment  with  magnetic  moments  of 
2-73  nB  and  1  -77  pB  for  Fej  and  Fe„  atoms.  This  is  in  agreement  with  the  earlier  neutron 
results  on  Fe4N  (Frazer  1958). 


Figure  5.  Magnetic  form  factor  of  face-centred  Fe  atom  in  Fe4N.  Unpolarised  neutron  data 
(open  circles)  gives  an  indication  of  the  results  discussed  in  the  text. 


316 


N  S  Satya  Murthy  et  al 


The  polarised  neutron  diffraction  data  on  Fe4N  was  analysed  on  lines  similar  to 
those  of  Mn4N.  The  magnetic  form  factor  for  the  Fe„  atom  has  been  obtained  from  the 
intensities  using  Fe+  (Watson  and  Freeman  1961)  form  factor  for  Fej  and  is  shown  in 
figure  5.  The  calculated  Fe+  and  Fe+  +  form  factors  (Watson  and  Freeman  1961)  are 
also  shown  for  comparison. 

The  form  factor  for  Fe„  is  significantly  expanded  as  compared  to  that  of  the  corner 
Fe,  atom  implying  a  contracted  moment  density  for  the  Fen  atoms.  The  unpaired 
electron  densities  of  Fe,  and  Fe„  are  thus  well  localised  and  do  not  overlap  leading  to  a 
ferromagnetic  ordering  between  Fe,and  Fe„.  The  reduction  in  the  moment  is  probably 
due  to  the  strong  crystal  field  effects  at  the  Fe„  sites  modifying  the  electron  density 
contributing  to  the  magnetic  moment. 


4.  Conclusion 

The  polarised  neutron  diffraction  data  analysed  on  the  basis  of  different  form  factors 
for  the  two  nonequivalent  sites  thus  give  a  significant  difference  in  the  unpaired  electron 
densities  at  the  corner  and  the  face-centred  atoms.  The  experimentally  determined 
moment  densities  have  qualitatively  been  explained  on  the  basis  of  an  existing 
theoretical  model. 
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Melting  in  two  dimensions — the  current  status 
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Abstract.  The  current  status  of  the  controversy  relating  to  melting  in  two  dimensions  is 
surveyed.  To  begin  with,  a  review  is  given  of  the  seminal  work  of  Kosterlitz  and  Thouless.  This 
is  followed  by  a  discussion  of  the  modifications  introduced  by  Nelson  and  Halperin.  The 
search  for  the  continuous  transitions  and  the  intermediate  hexatic  phase  predicted  by  these 
theories  is  then  described,  covering  both  the  laboratory  as  well  as  simulation  experiments. 
Alternate  viewpoints  to  the  kt  theory  aired  recently  in  the  literature  are  also  briefly  examined. 
The  paper  concludes  with  an  outlook  for  the  future. 

Keywords.  Two-dimensional  melting;  computer  simulation;  Kosterlitz-Thouless  theory; 
grain-boundary  mechanism. 
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1.  Introduction 

As  is  well  known,  melting  is  a  first-order  transition.  This  of  course  refers  to  three- 
dimensional  solids,  but  then  we  do  live  in  a  three-dimensional  world.  Be  that  as  it  may,  it 
is  interesting  to  speculate  on  the  nature  of  melting  of  lower  dimensional  solids.  The 
question  is  not  purely  academic  since  many  solids  actually  behave  as  if  they  were  of 
lower  dimensionality. 

Our  attention  in  this  paper  will  be  directed  to  the  melting  of  2D  solids,  a  topic  which 
during  the  past  decade  has  been  the  subject  of  active  research  as  well  as  of  lively 
controversy  (as  yet  unsettled).  After  discussing  why  2D  melting  is  of  interest  per  se ,  we 
will  explain  what  the  controversy  is  all  about.  We  will  then  make  a  broad  survey  of  the 
various  attempts  to  shed  light  on  the  controversy,  and  wind  up  with  an  outlook  for  the 
future.  Parenthetically  we  remark  that  although  our  focus  is  restricted  to  2D  melting, 
this  topic  actually  forms  a  part  of  the  general  area  of  phase  transitions  in  2-dimensions, 
a  subject  with  considerable  implications  far  beyond  condensed  matter  physics. 


2.  Some  preliminaries 

It  is  pertinent  as  well  as  useful  to  commence  our  discussion  with  a  few  general  remarks 
on  order  in  lower-dimensional  systems.  Nearly  fifty  years  ago,  Peierls  (1935)  and 
Landau  (1937)  demonstrated  that  long-range  order  is  impossible  in  a  2D  solid  in  the 
sense  that  <  (r  —  r)2  ),  the  mean-square  fluctuation  of  the  distance  r  between  two  atoms 
separated  by  average  distance  r,  increases  logarithmically  with  the  size  of  the  system,  in 
contrast  to  the  finite  value  one  obtains  for  a  three-dimensional  solid.  In  other  words, 
crystalline  order  is  rendered  impossible  due  to  fluctuations.  One  can  understand  this 
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intuitively  by  considering  the  number  of  nearest  neighbours  in  a  simple  cubic,  a  square 
and  a  one-dimensional  lattice.  Whereas  in  the  three-dimensional  lattice  there  are  6 
nearest  neighbours  to  “hold  back”  every  atom,  there  is  less  “check”  in  lower 
dimensional  systems,  facilitating  the  disruption  of  order.  Slightly  more  formally,  if 
<  u2{R)  )  denotes  the  mean  square  fluctuation  of  atom  at  R,  then  from  a  consideration 
of  the  thermal  agitation  caused  by  phonons  it  is  found  (Peierls  1935)  that 


This  integral  converges  in  three  dimensions  but  diverges  in  lower  dimensions,  implying 
the  absence  of  long-range  order.  In  physical  terms,  long  wavelength  phonons  prevent 
the  solid  from  fully  attaining  a  crystalline  structure. 

While  the  above  line  of  reasoning  is  no  doubt  persuasive,  one  might  legitimately 
wonder  why  it  is  that  the  famous  two-dimensional  Ising  model  does  not  fall  into  this 
slot  but  shows  long-range  ordering  instead.  Indeed,  stimulated  by  such  a  line  of 
thought,  there  were  several  studies  in  the  sixties  concerning  the  nature  of  order  in 
various  2D  systems.  Mermin  and  Wagner  (1966)  for  example,  proved  that  there  is  no 
spontaneous  magnetization  in  a  2D  magnet  with  spins  having  more  than  one  degree  of 
freedom  (-for  comparison,  the  Ising  spin  has  only  one  degree  of  freedom).  Shortly 
thereafter,  Hohenberg  (1967)  demonstrated  the  absence  of  long-range  order  in  a  2D 
Bose  fluid. 

Even  as  evidence  began  to  accumulate  concerning  the  absence  of  long-range  order 
(lro)  in  several  2D  systems,  Stanley  and  Kaplan  (1966)  discovered  by  analyzing  the 
high -temperature  series  expansion  that  the  susceptibility  of  the  2D  planar  magnet  (the 
so-called  XY  model)  diverged  at  a  particular  temperature,  suggestive  of  a  phase 
transition.  This  was  quite  intriguing  for,  till  then,  one  was  used  to  a  phase  transition 
accompanied  by  the  onset  of  order.  Did  the  work  of  Stanley  and  Kaplan  imply  that 
phase  transitions  were  possible  without  requiring  lro?  It  was  at  this  juncture  that 
Kosterlitz  and  Thouless  (1973)  put  forth  their  famous  idea  of  phase  transitions 
accompanied  by  a  change  of  “topological  order”  rather  than  the  conventional  lro. 
Understandably,  great  excitement  was  aroused,  and  various  2D  systems  were 
experimentally  studied  to  check  out  these  novel  concepts.  While  in  2D  superfluid  a 
Kosterlitz-Thouless  (kt)  transition  was  observed  as  predicted  (Bishop  and  Reppy 
1 978),  the  results  on  melting  have  been  quite  controversial  and  it  is  these  that  we  shall  be 
dwelling  upon. 


3.  The  Kosterlitz-Thouless  theory 

3.1  Aspects  of  the  2D  XY  model 

As  a  prelude  to  the  kt  theory,  we  shall  briefly  consider  the  2D  XY  model  in  order  to 
highlight  the  essential  physical  content  of  the  kt  mechanism.  We  begin  by  labelling 
systems  using  two  integers  ( d ,  n ),  denoting  respectively  the  dimensionality  of  the  system 
and  that  of  the  order  parameter  (i.e.  the  number  of  independent  components  in  it).  In 
this  scheme,  the  Ising  model  would  be  a  (2, 1)  system,  the  Heisenberg  model 
investigated  by  Mermin  and  Wagner  (1966)  would  be  a  (2,  3)  system  and  so  on.  The 
models  of  current  interest  (including  the  XY  model)  belong  to  the  (2,  2)  family. 
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The  2D  XT  model  refers  to  a  system  of  spins  constrained  to  rotate  in  a  plane.  The 
spins  are  often  taken  to  be  organized  on  a  lattice.  However,  one  eventually  goes  to  a 
continuum  limit,  whereupon  the  lattice  structure  can  be  ignored.  Observe  that  the  spin 
vector  can  continuously  rotate  and  has  an  infinity  of  orientations  to  choose  from,  in 
contrast  to  the  Ising  spin  which  has  only  two  options.  This  leads  to  an  important 
difference  in  the  group  structure  of  the  Hamiltonian.  Whereas  the  Ising  Hamiltonian 
has  the  discrete  symmetry  Z2  (group  of  two  elements),  the  Hamiltonians  of  some  of  the 
systems  we  shall  be  discussing  all  have  the  continuous  symmetry  0(2).  Particle 
physicists  have  strong  interest  in  2D  O(n)  models  of  various  types  (Barber  1980). 

Passing  reference  must  also  be  made  to  the  Potts  model  (1952)  which  is  a 
generalization  of  the  Ising  model  wherein  the  order  parameter  can  take  q  distinct 
values.  For  q  —  2  (in  2D),  the  model  reduces  to  the  Ising  model  while  in  the  limit  q  ->  oo 
one  recovers  the  XY  model.  The  Potts  model  thus  interpolates  between  the  Z2  and  0(2) 
models. 

Returning  to  the  XY  model,  its  Hamiltonian  may  be  expressed  as 


=  -J  £  S.-S,.,  (1) 

<y> 

where  J  >  0  and  the  sum  <  ij  )  is  over-nearest  neighbour  pairs  only  (on  a  square  lattice, 
say).  Introducing  for  convenience  the  complex  notation 


\jj  ~  Sx  —  iSy  =  IS  exp (i0), 


and  taking  S  =  1, 


H  =  —  J  X  cos  (0,-0,).  (2) 

<y  > 

At  low  temperatures  where  spins  on  neighbouring  sites  are  expected  to  be  highly 
correlated,  we  may  expand  the  cosine  and  retain  only  the  quadratic  term,  obtaining 

h  =  £  I  (di-dj)2.  (3) 

z  <y> 


The  above  Hamiltonian  corresponds  to  the  spin  wave  approximation  (Wegner  1967) 
and  is  the  starting  point  for  the  calculation  of  various  physical  properties.  Of  special 
interest  is 


C(r)=  <^(OW*(r)>, 

(4) 

the  correlation  function.  Analysis  shows 

C(r)  =  r-iP)  for  r -►  o o, 

(4a) 

where 

rj(T)  =  k^T/lnJ. 

(4b) 

This  result  confirms  the  absence  of  lro,  but  its  form  is  unexpected. 

To  appreciate  this  last  remark,  let  us  briefly  recall  the  corresponding  result  for  the 
Ising  model  (see  figure  la).  In  the  disordered  phase,  correlations  decay  exponentially 
but  precisely  at  the  critical  point,  they  have  a  power  law  behaviour  (Stanley  1971).  By 
contrast,  the  spin  wave  approximation  to  the  XY  model  exhibits  a  power  law  behaviour 
at  all  temperatures  (see  figure  lb).  KT  proposed  that  if  one  goes  beyond  the  spin  wave 
approximation  and  includes  appropriate  topological  defects,  the  line  of  critical  points 
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exp(-r/^  +  ) 
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(b) 


r-'TUT) 


exp(-r/^+) 


Tc 

Figure  1 .  Schematic  summary  of  the  phase  transitions  of  the  2D  Ising  (a)  and  X  Ymodels  (b), 
(c).  In  (b)  is  shown  the  X  Ymodel  in  the  spin  wave  approximation  where  every  temperature  is  a 
critical  temperature.  When  vortices  are  included,  the  line  of  critical  points  is  terminated  at  Tc  as 
shown  in  (c),  with  a  transition  in  the  correlation  function  (After  Young  1980.) 


would  terminate  at  a  finite  temperature  Tc  (see  figure  lc).  Below  that  temperature  there 
is  power  law  behaviour  while  above  there  is  exponential  decay.  The  transition  is  thus 
characterized  by  changes  in  the  correlation  function  rather  than  by  the  appearance  or 
disappearance  of  lro.  As  we  shall  presently  see,  these  changes  are  linked  to  the 
topological  order  in  the  system. 


3.2  Role  of  topological  defects 


How  do  topological  defects  alter  the  character  of  the  system  at  Tc  as  sketched  in 
figure  lc?  To  answer  this  question,  consider  the  vortex  in  figure  2a.  This  and  its 
companion  in  figure  2b  are  outside  the  purview  of  the  spin  wave  approximation. 
Observe  that  either  of  these  vortices  is  capable  of  strongly  disturbing  spins  far  from  the 
region  of  the  singularity.  By  contrast,  in  the  spin  wave  approximation  there  are  no  such 
long-range  perturbing  agencies  (see  figure  2c).  However,  with  or  without  vortices,  C(r) 
always  decays  as  r  -*>  oo  i.e.  there  is  no  lro. 

Vortices  are  like  charges,  and  can  be  labelled  with  integers  N  positive  and  negative. 
Those  in  figure  2  are  the  simplest,  with  \  N\  =  1 .  The  energy  of  an  isolated  vortex  may  be 
calculated  from  (2)  by  using  the  continuum  approximation,  and  is  found  to  be 
(Kosterlitz  and  Thouless  1973) 

J 


CR 


E  (vortex)  —  Ec  + 


27trdr  —  |V0|‘ 


ao 


Ec  +  nJ  In  {R/a0). 


(5) 


Here  R  is  the  radius  of  the  system  and  a0  is  a  distance  of  atomic  dimensions 
characterizing  the  core.  Ec  is  the  core  energy  associated  with  the  region  r  <  a0.  Notice 
that  the  energy  of  an  isolated  vortex  tends  to  infinity  if  the  radius  of  the  system  goes  to 
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Figure  2.  Some  possible  spin  arrangements  of  the  2D  XY  system,  (a)  and  (b)  depict  the 
situation  in  the  presence  of  vortices,  and  clearly  there  is  a  substantial  disturbance.  By  contrast, 
there  is  less  disturbance  in  (c)  which  illustrates  the  spin  wave  approximation. 


infinity,  implying  that  it  would  be  very  difficult  to  create  a  vortex  in  the  thermodynamic 
limit.  As  we  shall  presently  see,  this  statement  will  require  some  qualification. 

Next  consider  the  vortex-antivortex  pair  in  figure  3.  On  account  of  a  mutual 
cancellation  effect,  the  spins  at  large  distances  are  no  longer  severely  disturbed,  the 
disturbance  being  confined  to  the  neighbourhood  of  the  pair.  As  may  be  expected,  the 
energy  of  the  pair  is  finite,  being  given  by 

E  (pair)  =  2 Ec  +  2nJ  In  (r/a0),  (6) 

where  r  is  the  separation. 

Going  back  to  the  single  vortex,  although  its  creation  is  energy-expensive,  there  is  an 
entropy  advantage  associated  with  the  fact  that  one  can  locate  the  vortex  in  various 
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Figure  3.  Vortex-antivortex  pair  in  the  2D  X  Y  system. 


ways.  This  entropy  is  2k  B  In  ( R/a0 )  so  that  the  free  energy  is 
F  =  E  —TS 

=  (nJ  —  2kBT)  In  (R/a0),  (7) 

implying  that  for  T  >TC  where 

Tc  =  nJ/2kB,  (8) 

free  vortices  may  indeed  exist  even  in  the  therfhodynamic  limit. 

Based  on  the  above  arguments,  kt  proposed  the  following  scenario:  At  low 
temperatures,  only  vortex  pairs  can  exist.  These  are  thermally  generated  and  ride  over  a 
spin  wave  background.  Since  (bound)  pairs  do  not  cause  serious  perturbations  at  large 
distances  (see  figure  3),  C  (r)  is  essentially  that  given  by  the  spin  wave  approximation. 
Above  rc,  the  pairs  dissociate  since  the  system  can  now  support  free  vortices.  Technical 
considerations  (to  be  amplified  later)  demand  that  the  number  of  vortices  be  small. 
Even  so,  they  drastically  modify  C  (r)  (-see  also  figure  2).  As  already  noted,  with  such 
strong  fluctuations  present,  the  spin  wave  approximation  does  not  apply.  Further,  like 
their  counterparts  in  solids  namely  dislocations,  vortices  can  move  freely  when 
subjected  to  suitable  forces.  The  system  as  a  whole  therefore  acquires  flow  properties 
and  exhibits  a  concomitant  loss  of  (spin)  rigidity  at  Tc.  This  then  is  the  central  feature  of 
the  transition,  the  unbinding  of  pairs  at  Tc  and  the  related  loss  of  the  corresponding 
rigidity. 

3.3  Topological  order 

Though  conventional  lro  is  absent,  kt  suggest  that  one  may  differentiate  between  the 
phases  on  either  side  of  Tc  via  the  concept  of  topological  order.  Briefly,  topological 
order  is  an  assessment  of  whether  the  topological  defects  are  bound  (“ordered”)  or 
unbound  (“disordered”).  To  make  such  an  assessment,  one  makes  a  large  closed  circuit 
in  the  system  and  measures  the  phase  angle  all  along  the  boundary.  The  total  change  of 
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the  phase  angle  will  be  determined  by  the  number  and  strength  of  the  singular  points 
enclosed  by  the  path.  In  the  high  temperature  phase  (figure  4a),  there  will  be  a 
preponderance  of  isolated  singularities.  The  number  enclosed  will  be  proportional  to 
the  area  A  of  the  contour  C  and  the  average  total  phase  change  will  be  proportional  to 
the  length  L  of  C.  In  the  low-temperature  phase  (figure  4b),  the  average  total  phase 
change  will  be  determined  by  the  number  of  pairs  cut  by  the  path,  and  will  be 
proportional  to  (L d)i/2  where  d  is  the  mean  separation  of  the  pairs.  Based  on  this 
criterion,  one  can  recognize  a  change  in  the  topological  order  at  Tc.  It  is  worth  noting 
that  a  similar  assessment  by  traversing  a  closed  contour  is  also  made  in  lattice  gauge 
theories  (Kogut  1979). 

3.4  2D  melting — The  KT  view 

We  turn  now  to  melting  in  two  dimensions.  Analogous  to  the  spin  wave  approximation, 
we  have  the  harmonic  (phonon)  approximation.  Introducing 

Pg(R)  —  exp  (iG  •  R),  (9) 

where  G  denotes  a  reciprocal  lattice  vector,  we  may  as  usual  define  the  correlation 
function 

Cc(R)=  <PG(R)PG*(0)>,  (10) 

which,  for  the  2D  solid,  varies  as 

lim  Cg(R)~  R-*g(t\  (11) 

R  -*  oo 


Figure  4.  Assessment  of  topological  order  is  made  by  going  round  a  simple  closed  contour 
and  measuring  the  overall  change  in  phase  angle.  As  discussed  in  text,  the  result  depends  on 
whether  the  topological  defects  are  unpaired  (as  in  (a))  or  are  paired  (as  in  (b)). 
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One  can  now  calculate  the  structure  factor 


S(q)=  <|p(q)|2  >, 


whereupon  one  finds  that  the  familiar  <5-function  Bragg  singularities  i.e 


(12) 


S(q)~<5(q-G) 


become  “smeared”  into  power  law  singularities  (Jancovici  1967)  i.e.. 


S(  q)~ 


q  —  G|  -2+*cen 


(13) 


One  therefore  has  a  “quasi”  crystal  which,  incidentally,  is  the  reason  why  one  may  still 
label  the  diffraction  peaks  with  the  set  { G }.  Well-defined  peaks  in  S  (q)  can  be  observed 
provided  rjG<  2. 

Now  dislocation-mediated  melting  is  really  an  old  concept  in  three  dimensions,  going 
back  to  the  thirties  (in  a  sense  to  Mott  and  Gurney  1939;  for  a  review,  see  Cotterill 
1980).  With  respect  to  two  dimensions*,  kt  note  that  at  low  temperatures,  the  system, 
though  lacking  crystallinity  (see  equation  (13)),  is  a  solid  in  that  it  is  able  to  resist 
external  shear.  Besides  phonons,  the  system  also  supports  bound  dislocation  pairs 
which  are  thermally  generated.  The  latter  neither  upset  the  power  law  decay  of  CG(r) 
(see  equation  (11))  nor  the  rigidity.  Above  Tm  the  melting  temperature  the  pairs  break, 
and  once  free  dislocations  appear  the  system  loses  (elastic)  rigidity,  signalling  a 
transition  to  the  fluid  state.  The  topological  order  can  be  assessed  as  earlier  i.e.  by 
making  the  Burgers  circuit  and  measuring  the  failure  in  the  path  closure. 

Regarding  the  details  of  the  transition,  one  clearly  needs  a  more  refined  analysis  than 
contained  in  equations  (5)— (8).  The  crucial  feature  in  such  an  analysis  is  the  screening  of 
the  pair  interaction  (6).  The  underlying  physics  is  better  appreciated  by  considering  the 
analogous  case  of  a  2D  gas  of  charged  particles1  with  charges  ±  q ,  electrically  neutral 
on  the  whole,  and  interacting  via  the  logarithmic  potential 


tf(|ri-ij|)  =  -2qtqj  In 


r*  ~  r; 


+  2/x,  r  >  a 


o 


r  <  a0. 


(14) 


Here  qt  and  are  the  charge  and  position  of  the  ith  particle.  2/x  is  the  energy  required  to 
create  a  pair  of  particles  of  equal  and  opposite  charge  a  distance  a0  apart.  In  2D  solids, 
the  analogue  of  is  the  dislocation  core  energy  Ec  (-see  also  equation  (5)).  The  quantity 
a0  is  an  appropriate  cut-off  of  the  order  of  the  particle  diameter  or,  for  a  lattice,  the 
lattice  spacing.  One  expects  that  due  to  (14),  opposite  charges  can  link  to  form  bound 
pairs,  i.e  dipoles.  For  these  pairs  to  unbind,  U(r)  must  decrease,  which  can  happen  by 
screening. 

Focusing  attention  on  a  particular  dipole  of  size  r,  the  interaction  between  the  two 
members  constituting  the  dipole  will  be  screened  by  (smaller)  pairs  lying  within  the 


*  For  historical  completeness  it  must  be  remarked  that  Feynman  had  independently  proposed  the  idea  of 
dislocation-assisted  melting  in  two  dimensions.  Unfortunately,  this  work  is  unpublished.  An  outline  of  this 
theory  is  given  by  Elgin  and  Goodstein  (1974)  and  by  Dash  (1978). 

+  By  now  the  reader  would  undoubtedly  have  noticed  that  there  are  a  variety  of  2D  systems.  For 
elucidating  various  features  of  the  kt  theory,  we  cite  different  examples  as  convenient! 
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range  of  the  field.  These  pairs  will  in  turn  be  screened  by  others  lying  in  their  respective 
fields  and  so  on.  One  thus  has  a  scaling  situation  as  in  the  case  of  critical  fluctuations 
present  during  a  conventional  second-order  phase  transition,  and  not  surprisingly, 
renormalization  group  techniques  make  their  entry. 

Owing  to  screening,  (14)  becomes  modified  to 


2 q2  In  (r/a0) 
6(r) 


(15) 


where  e(r)  is  the  space-dependent  dielectric  function,  and  includes  the  polarizability  of 
pairs  with  separation  less  than  r.  Introducing  p(r)  the  probability  of  finding  a  pair  with 
separation  r,  one  finds  that  e(r)  and  p{r)  are  coupled  as  follows  (Kosterlitz  1974; 
Halperin  1979): 


e(r  +  dr)  =  e(r)  +  4n  x  pol.  contributed  by  pairs  in  the 
range  r  and  (r  +  dr) 

r2 

=  e(r)  +  47i  —  27irp(r)dr, 


p(r  -f  dr)  =  p(r)  exp 


2 

rTe(r) 


dr. 


(16) 


Here  r1  /2T  is  the  polarizability  of  a  pair  with  separation  r  and  [2dr/re(r)]  is  the  work 
done  in  increasing  the  separation  of  the  pair  by  the  amount  dr.  Using  an  iterative 
procedure,  one  obtains  from  (16)  the  famous  Kosterlitz  recursion  relations  (Kosterlitz 
1974).  Such  relations  can  be  written  down  for  every  (2,  2)  system  i.e.  the  JKYmodel,  2D 
crystal  etc. 

We  turn  to  the  outcome  of  the  recursive  relations  in  the  specific  case  of  2D  crystal. 
Two  crucial  quantities  are: 


(i) 

(ii) 


y  =  exp  (-EJkJT),  and 

4p(^  +  A)  o2 
(2g  +  X)  kj' 


(17) 


where  p  and  X  are  the  Lame  constants  and  a  the  lattice  spacing.  During  the 
renormalization  calculations,  one  has  to  rescale  the  dislocation  core  diameter  from  a0 
to  a0  exp  /  causing  y  and  K  to  become  /-dependent.  The  recursive  relations  are  of  the 
form  (Kosterlitz  1974;  Nelson  1978;  Nelson  and  Halperin  1979;  Young  1979): 

dKv7-f)=/2(y(0,  *('))•  (18) 


The  solution  of  these  equations  is  too  intricate  to  be  discussed  here.  Of  greater 
interest  are  the  renormalization  group  flows  illustrated  in  figure  5.  The  different 
trajectories  effectively  determine  the  system  behaviour  at  different  temperatures. 

Let  us  follow  a  typical  trajectory  for  T  <Tm  from  the  starting  line  (see  figure  5).  The 
flow  takes  us  to  the  abscissa  where  y(l)  is  zero  and  K(l)  has  a  finite  value,  which  implies 
that  (i)  the  probability  of  finding  a  free  dislocation  is  zero  and  (ii)  the  renormalized 
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Figure  5.  Renormalization  group  flow  in  the  kt  theory.  The  quantities  K  and  y  are  defined 
in  (17).  The  line  forT  =  Tm  is  the  separatrix.  The  dislocation  probability  y(l)  tends  to  zero  in  the 
region  to  the  left  of  the  incoming  separatrix  which  is  the  crystalline  phase.  The  trajectories  to 
the  right  flow  away  to  large  y-values,  signalling  instability  and  the  dissociation  of  dislocation 
pairs. 


stiffness  is  finite.  The  trajectory  for  T  —  Tm  is  the  separatrix,  and  trajectories  to  its  right 
flow  away,  implying  that  for  T  >  Tm  (i)  free  dislocations  can  be  found  and  (ii)  the  system 
loses  rigidity  since  £(/)-►  0.  The  system  behaviour  close  to  Tm  may  be  carefully 
examined  by  zooming  near  the  region  where  the  separatrix  curve  meets  the  abscissa. 
The  principal  findings  are: 

(i)  CG(R)~R-ic(T\  T^T~,  (19) 


where  riG(T)  is  related  to  the  renormalized  Lame  constants  nR(T)  and  ar(T )  by 


riG(T)  =  k^T\G 


(3fxR  +  2;;) 

^  R{/.[i  R  ~l~  2^) 


(20) 


At  Tm,  the  exponent  for  the  first  Bragg  point  for  a  triangular  lattice  cannot  exceed  (1/3). 

R 


(ii) 


Cg(R)  -  exp 


T-+T 


the  correlation  length  £+  diverging  according  to 

const 


m  ? 


(21) 


~  exp 


v  =  0-36963. 


C T-rmy 

(iii)  The  renormalized  stiffness  constant  KR(T)  (see  equation  (14))  varies  as 


(22) 


KRl(T-+Tm) 


1 


1671 


(i-c  tn 


(23) 


where  C  is  a  positive,  nonuniversal  constant  and  t  =  ( T  —  Tm)ITm.  AboveTm  the  stiffness 
vanishes;  KR  therefore  abruptly  jumps  to  zero  from  the  value  \6n  at  T  =  Tm. 

(iv)  The  specific  heat  has  only  an  essential  singularity.  Physically  this  is  related  to  the 
fact  that  at  Tm  the  density  of  dislocation  is  rather  low.  There  is  therefore  very  little 
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entropy  associated  with  it.  However  above  Tm  there  is  a  small  bump  (Berker  and  Nelson 
1979). 

Next  to  the  order  parameter,  the  quantities  of  interest  in  a  conventional  phase 
transition  are  those  which  exhibit  singularities.  In  the  case  of  the  kt  transition,  there  is 
no  ordering  in  the  conventional  sense.  The  specific  heat  also  is  not  very  informative.  The 
only  meaningful  signatures  are  the  translational  correlations  CG(R)  on  either  side  of  Tm 
(see  (19)  and  (21)),  and  the  temperature  dependence  of  the  stiffness  (see  (23)),  all  of 
which  are  conveniently  studied  via  x-ray  and  neutron  scattering. 

A  few  other  related  points  must  also  be  noted  at  this  juncture.  Firstly,  the  calculation 
of  the  screening  via  (16)  is  a  self-consistent  one,  and  for  this  reason  the  kt  theory  is 
essentially  a  mean  field  type  of  theory.  Secondly,  while  solving  the  recursive  equations, 
the  approximation  is  made  that  y  is  small  ( i.e  defect  density  is  small;  see  also  remark  (iv) 
above),  implying  that  the  core  energy  Ec  is  large  (see  (17)).  So  the  picture  ( T  >Tm)  is  that 
of  a  dilute  system  of  dislocations  floating  about  in  an  otherwise  harmonic  solid.  Should 
the  cores  overlap,  the  theory  collapses.  In  a  superfluid,  the  vortex  core  size  is  ~  1  A  and 
overlap  is  not  a  serious  problem.  On  the  other  hand,  the  core  size  of  a  dislocation  could 
be  several  lattice  spacings,  and  overlap  becomes  a  real  possibility  (Kosterlitz  1980).  One 
must  therefore  be  ready  for  surprises.  We  shall  revert  to  this  question  later.  Finally,  the 
transition  is  continuous;  note  especially  the  divergence  of  the  correlation  length 
(equation  (22)),  reminiscent  of  a  conventional  second-order  phase  transition. 
(However,  a  significant  difference  is  that  {  has  an  exponential  rather  than  a  power  law 
behaviour,  Stanley  1971.) 


3.4  Role  of  orientational  correlations 

About  five  years  after  kt  advanced  their  theory,  Halperin  and  Nelson  (1978;  see  also 
Nelson  and  Halperin  1979)  drew  attention  to  an  important  aspect  of  the  problem  not 
considered  by  KT*.  Nelson  and  Halperin  (nh)  argued  that  besides  translational 
correlations,  one  must  also  consider  orientational  correlations,  and  that  when  a  solid 
melts  to  become  an  isotropic  liquid,  it  sheds  both  these  correlations,  nh  therefore 
introduce  an  orientational  order  parameter  i jj.  Most  of  the  focus  so  far  has  been  on 
triangular  lattices  which  are  the  most  close  packed  structures  in  two  dimensions.  For 
this  lattice,  the  bond  orientational  parameter  is  given  by 

iA(r)  =  exp[i60(r)],  (24) 


where  0(r)  is  the  orientation  relative  to  the  bond  between  nearest  neighbour  atoms.  6  is 
related  to  the  displacement  field  u(r)  by 


1  ( duy(r)  cujr)\ 
e(r,  =  2 


(25) 


Analogous  to  (10),  one  now  has  a  orientational  correlation  function 

C6(r)  =  <Wr(r)>.  (26) 


Extending  attention  to  orientational  effects  requires  one  now  to  consider  new 


*  Taking  into  account  a  related  paper  by  Young  (1979),  the  melting  theory  is  often  referred  to  as  kthny 
theory. 
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defects,  namely  disclinations.  These  are  “rotational  singularities”,  two  examples  of 
which  are  illustrated  in  figure  6.  They  are  formed  by  taking  a  perfect  triangular  lattice 
and  either  removing  or  adding  a  wedge.  The  curvatures  resulting  thereof  are  significant 
(see,  for  example,  Nelson  1983a)  but  we  shall  not  consider  them  here.  The  interesting 
point  about  disclinations  is  that  a  dislocation  can  be  viewed  as  resulting  from  the 
binding  of  a  disclination  pair,  as  illustrated  in  figure  6.  In  other  words,  a  dislocation  can 
be  regarded  as  a  disclination  dipole. 

Based  on  the  above  additional  concepts,  NH  sketch  a  scenario  for  melting  involving 
not  only  dislocations  but  disclinations  as  well.  At  low  temperatures,  the  dislocations  are 
paired,  and,  on  account  of  the  internal  structure  of  the  dislocation  (see  figure  6),  the 
dislocation  pair  can  be  viewed  as  a  disclination  quadrupole.  The  harmonic  phonons  as 
usual  disrupt  positional  lro  leading  to  a  power  law  behaviour  for  the  translational 
correlations  as  in  (1 1),  but,  by  contrast,  the  orientations  exhibit  lro  i.e ., 

lim  C6(r)  =  |<i/i  >|,  (27) 

r  -*  oo 

a  constant  which  is  temperature-dependent. 


Figure  6.  (a)  and  (b)  illustrate  respectively  five-  and  seven-  fold  rotational  singularities  (i.e. 
disclinations).  For  convenience,  the  atoms  are  suppressed  and  only  the  surrounding  Wigner- 
Seitz  cells  are  shown,  (c)  illustrates  how  a  pair  of  disclinations  can  combine  to  form  a 
dislocation.  Here  the  atoms  are  shown  along  with  their  W-S  cells. 
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As  the  temperature  is  gradually  increased,  a  stage  is  reached  ( T —  Tm )  where 
dislocations  unbind.  This,  however,  does  not  produce  an  isotropic  liquid;  instead  the 
system  enters  an  intermediate  phase  in  which,  CG(r)  decays  exponentially  as  in  (21)  but 
rotational  correlations,  though  asymptotically  decaying,  are  still  strong,  being  given  by 

C6(r)  ~  T^T:,  (28) 

where 

rt6(T)=18kBT/nKA(T).  (29) 

The  quantity  KA(T)  called  the  Frank  constant  is  related  to  the  so-called  bending  energy 
of  orientational  fluctuations  as 


d  2r(V6)2. 


(30) 


In  effect,  KA  is  a  measure  of  the  “angular  stiffness’  of  the  system.  Below  Tm,KA  —  oo 
indicative  of  perfect  angular  correlations  but  above  Tm ,  there  is  a  temperature 
dependence  given  by 

KA(T)~iUn  r-r:,  (31) 

where  £  +  (T)  is  the  same  as  in  (22).  This  divergence  is  of  course  linked  to  the  appearance 
of  long  range  orientational  order  below  Tm  (see  (27)). 

The  phase  with  properties  as  in  (21),  (28)  and  (31)  is  like  a  liquid  crystal  in  that  it  has 
greater  orientational  order  (power  law  decay)  than  translational  order  (exponential 
decay).  This  phase  is  frequently  referred  to  as  a  hexatic  phase. 

When  heated,  the  hexatic  phase  in  turn  undergoes  a  transition  at  a  temperature  Tiy 
becoming  an  isotropic  liquid.  This  occurs  via  the  unbinding  of  the  disclination  pair, 
causing  a  loss  of  the  orientational  correlations  present  in  the  hexatic  phase.  Thus,  in  the 
NH  picture,  the  free  topological  defects  in  the  isotropic  liquid  are  not  the  dislocations 
but  the  disclinations.  As  noted  by  nh,  the  transition  at  Tt  belongs  to  the  same 
universality  class  as  the  two-dimensional  superfluid  and  the  XT  model,  whence, 

t]f>  — ►  if  as  T  — >  T i  .  (32) 


The  renormalized  Frank  constant  jumps  discontinuously  to  zero  at  T{  from  the  value 
12kBT/n  to  which  it  decreases  as  T  ->T[ .  Above  Th 


with 


C6(r)  ~  exp (  — r/{6 (T)) 


{6(T)~ex  p 


b 

(T-Tif12 


(33) 

(34) 


As  usual,  there  is  only  an  essential  singularity  in  the  specific  heat.  In  short,  at  T{  there  is 
again  a  crossover  from  a  power  law  to  exponential  decay,  but  this  time,  of  the 
orientational  correlation  function.  Accompanying  this  change  is  a  vanishing  of  the 
orientational  stiffness. 

Figure  7  paraphrases  the  features  of  the  original  kt  theory  and  the  subsequent  nh 
modification.  In  passing  we  note  that  although  Halperin  and  Nelson  (1978)  also 
predicted  continuous  transitions,  they  were  cautious  enough  to  observe  that  they 
“cannot  rule  out  other  mechanisms  for  melting,  perhaps  leading  to  a  first-order 
transition”. 


330 


G  Venkataraman  and  D  Sahoo 


r-\<T> 


exp(-r/g) 


m 


r-\(T) 

K1»l 


lexp{—  r/g )!: 
I'r'-TigCT)  !| 
HEXATIC 


exp  (-r/g6) 


m 


* 


Figure  7.  Comparison  of  the  transitions  predicted  by  the  original  kt  theory  and  by  the 
subsequent  nh  modification.  For  explanation,  see  text. 


Laboratory  experiments  on  2D  melting  are  frequently  performed  on  layer  systems 
supported  by  a  substrate,  nh  have  examined  the  role  played  by  the  substrate  by  adding 
an  orienting  field  to  the  defect  Hamiltonian,  and  find  the  following:  If  the  substrate  can 
be  regarded  as  smooth,  then  the  considerations  of  figure  7  continue  to  remain  valid.  On 
the  other  hand,  if  the  periodic  nature  of  the  perturbing  potential  from  the  substrate  is 
significant,  then  one  must  examine  whether  the  periodicity  of  the  substrate  is 
commensurate  or  not  with  that  of  the  layer  above  (see  also  figure  8).  Of  interest  is  the 
case  where  the  substrate  potential  is  weak  and  incommensurate.  In  this  situation,  there 
is  only  one  transition,  namely,  that  at  Tm ;  the  other  one  at  Tt  is  washed  out  (for  a 
triangular  lattice). 

A  second  question  relates  to  phase  diagrams.  In  condensed  matter,  we  are  used  to 
diagrams  as  in  figures  9a  and  9b.  One  would  like  to  know  how  these  are  modified  in  the 
light  of  the  kthny  theory.  The  outcome  is  sketched  in  figure  9c  and  9d,  and  will  become 
relevant  later. 

The  prelude  has  been  somewhat  lengthy  but  is  indicative  of  the  fact  that  2D  systems 
are  a  theorist’s  paradise!  As  we  have  seen,  topological  defects  in  2D  are  point  defects.  It 
turns  out  that  the  statistical  mechanics  of  these  is  a  lot  simpler  than  that  of  line  defects 
(with  which  one  has  to  deal  in  3D).  Hence  the  great  popularity  of  2D  systems  with 
theoreticians.  We  turn  now  to  the  various  attempts  made  to  check  the  kthny  theory. 
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4.  Experimental  studies 

4.1  Laboratory  experiments 

There  are  a  number  of  candidate  systems  like  liquid  crystal  films,  electron  films  on 
liquid  helium,  etc  which  are  suitable  for  experimentally  checking  out  the  kthny  theory. 
Many  of  these  have  indeed  been  investigated  (Sinha  1980)  but  the  most  detailed  studies 
so  far  have  been  on  rare  gas  films  absorbed  on  graphite. 

The  basal  i.e  (0001)  plane  of  graphite  is  a  good  substrate  for  forming  monolayers  by 
physisorption  since  it  presents  a  relatively  smooth  potential  to  the  layer  absorbed. 
Unfortunately,  large  single  crystals  of  graphite  are  not  available  but  this  has  not 
deterred  experimentalists.  Instead  they  have  skilfully  exploited  various  exfoliated 
graphite  products.  Basically  all  of  these  are  built  up  of  small  flakes  a  few  pm  broad  and 
~  100-300  A  thick.  The  useful  size  is  characterized  by  the  coherence  length  L  (see 
figure  10).  The  c  axes  of  these  flakes  are  all  partially  oriented  about  the  normal  to  the 
sheet  but  the  azimuthal  orientation  about  the  c  axis  is  random.  Various  types  of 
exfoliated  graphite  like  Grafoil,  ucar-xyz  and  Papyex  are  now  commercially  available. 
Very  popular  (especially  in  the  past),  Grafoil  has  L  ~  200  A,  A 9  ~  25°  (see  figure  10) 
and  a  specific  area  for  adsorption  ~  30  m2/g.  The  samples  (for  scattering  experiments) 
typically  consist  of  a  pile  of  discs  cut  from  sheets  of  exfoliated  material.  Samples  with 
several  grams  of  Grafoil  are  not  uncommon.  Very  recently,  “single  crystals”  of 
exfoliated  graphite  have  been  prepared  (Clarke  et  ai  to  be  published;  quoted  in 
Rosenbaum  et  al  (1983))  with  L  ~  400  A  and  AO  ~  3°. 

Although  notionally  the  graphite  substrate  may  be  deemed  to  exert  only  a  weak 
influence,  the  role  of  the  substrate  cannot  be  entirely  overlooked.  As  already  noted  in 
figure  8,  the  adsorbed  layer  could  be  commensurate*  as  well  as  incommensurate  with 
the  substrate.  Our  interest  is  in  the  latter  situation. 


Figure  10.  Schematic  drawing  of  the  Grafoil  sheet.  L  indicates  the  typical  coherence  length 
of  the  adsorbed  layers  while  P(6)  describes  the  distribution  arising  from  the  non-parallelism  of 
the  substrate  surfaces  (After  Nielsen  et  al  1980). 


*  It  is  interesting  that  if  the  adsorbed  atoms  are  in  registry  with  the  substrate,  <  u2  >  does  not  diverge 
(Kosterlitz  1980). 
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Monolayers  of  Ar,  Kr  and  Xe  on  graphite  have  been  studied  using  both  neutron  and 
x-ray  scattering  (Thorel  et  al  1976;  Taub  et  al  1977;  Horn  et  al  1978;  Stephens  et  al  1979; 
Hammond  et  al  1980;  Heiney  et  al  1982;  McTague  et  al  1982;  Heiney  et  al  1983)  and  the 
opinion  has  emerged  that  Xe  on  graphite  is  by  far  the  best  representative  of  2D  field- 
free  system  (McTague  et  al  1982).  Accordingly  we  shall  examine  the  recent  high 
resolution  results  of  Heiney  et  al  (1983)  obtained  at  the  Stanford  Synchrotron 
Radiation  Laboratory  (ssrl).  These  experiments  are  a  sequel  to  an  earlier  series  done  at 
mit  using  a  rotating  anode  source  but  with  somewhat  lower  resolution. 

Heiney  et  al  have  criss-crossed  the  phase  diagram  along  various  trajectories 
(including  constant  temperature  ones)  in  order  to  check  out  different  possibilities. 
Figure  1 1  shows  representative  profiles  observed  in  one  of  these  runs.  The  asymmetric 
lineshapes  are  due  to  excess  high  q  scattering  arising  from  substrate  azimuthal  disorder. 
Around  1 52  K  the  lineshape  changes,  going  over  to  a  Lorentzian  charateristic  of 
exponential  decay  of  the  correlation  function.  Heiney  et  al  reckon  that  a  transition 


Figure  11.  Diffraction  profiles  of  the  (1,  0)  spot  of  xenon  adsorbed  on  graphite.  The  scans 
have  been  made  in  the  narrow  region  between  151-6  K  to  160K  (After  Heiney  et  al  1983). 
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Figure  12.  At  the  top  are  shown  the  inverse  correlation  lengths  obtained  by  Loren tzian  fits 
to  the  data  of  figure  1 1 .  The  solid  lines  are  fits  to  the  kthny  form  while  the  dashed  lines  are 
power  law  fits.  At  the  bottom  is  shown  the  temperature  dependence  of  rj  (After  Heiney  et  al 
1983). 

occurs  at  152K.  The  question  whether  it  is  of  first  order  or  continuous  has  been 
examined  quite  carefully.  Firstly,  no  hysteresis  is  observed.  Secondly,  as  the  transition 
temperature  is  approached  from  below,  the  exponent  obtained  from  the  lineshape 
analysis  (Dutta  and  Sinha  1981)  has  the  trend  shown  in  figure  12.  At  the  melting 
temperature  it  is  found  that  0-27  ^  t]  (melting)  ^  042,  reasonably  consistent  with 
kthny  prediction  025  ^  rj(Tm)  ^  033.  Above  152  K  where  the  line  shape  is  consistent 
with  (22),  analysis  yields  v  =  0*4  (see  also  figure  12). 

The  evidence  cited  above  no  doubt  strongly  favours  the  kthny  theory.  Nevertheless 
Heiney  et  al  have  carefully  examined  possible  consistency  of  their  data  with  first-order 
transition.  If  the  transition  is  indeed  of  first-order  then  near  the  transition  temperature 
one  must  encounter  a  coexistence  patch.  Assuming  that  the  solid  material  is  converted 
into  a  liquid  according  to  a  lever  law,  they  have  analyzed  their  data.  From  the  values  of 
X2  obtained  they  conclude  that  the  observed  melting  is  consistent  with  a  continuous 
transition  although  a  possible  two-phase  coexistence  in  a  very  narrow  range  between 
1 51*6  and  151-95  K  cannot  be  ruled  out. 

Recently,  Rosebaum  et  al  (1983)  studied  the  melting  of  monolayer  xenon  on  the 
surface  of  exfoliated  single  crystals,  using  x-ray  scattering.  According  to  the  NH  theory, 
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in  the  solid  one  must  get  six  (1, 0)  Bragg  spots,  which,  owing  to  the  lack  of  lro,  will  have 
some  width  both  in  the  radial  as  well  as  angular  directions  with  probably  more  in  the 
former.  On  melting  into  the  hexatic  phase,  the  six  spots  still  survive*  with,  however, 
enhanced  width  in  both  directions.  As  temperature  is  increased,  the  changes  in  the 
orientational  correlations  will  be  more  rapid  than  the  corresponding  ones  in  the 
positional  correlation.  Consequently  the  angular  width  will  increase  more  rapidly 
compared  to  the  radial  width  until  at  Tx  the  spots  merge  to  form  a  liquid  ring.  Such  a 
behaviour  (see  figure  1 3)  is  in  fact  seen  in  experiments.  Rosenbaum  et  al  categorically 
rule  out  from  the  observed  lineshapes,  the  possibility  of  a  liquid-solid  coexistence.  Nor 
do  they  believe  that  the  observed  spots  could  be  due  to  an  isotropic  liquid  subjected  to  a 
substrate  field.  The  field  strength  required  would  be  three  orders  of  magnitude  too  high. 
They  are  confident  that  their  experiment  offers  evidence  for  melting  of  the  2D  solid  into 
an  orientationally  ordered  liquid. 

Besides  adsorbed  layers,  melting  of  liquid  crystal  films  also  have  been  investigated  to 
test  the  validity  of  the  kthny  theory.  An  important  feature  of  these  experiments  is  the 
use  of  freely  suspended  films,  the  preparation  of  which  is  schematically  illustrated  in 
figure  14.  While  a  monolayer  film  has  yet  to  be  realized,  films  with  as  low  as  two  layers 
have  been  successfully  used. 

Moncton  and  coworkers  (Moncton  and  Pindak  1979,  1980;  Moncton  et  al  1982) 
have  been  particularly  active  in  the  study  of  x-ray  diffraction  from  such  films.  An  early 
experiment  (Moncton  and  Pindak  1980)  on  a  material  called  40-8  (butyloxybenzylidene 
octylaniline)  seemed  to  suggest  that  the  melting  was  continuous  with  rj  at  melting  in  the 
range  specified  by  the  kthny  theory.  However,  films  thinner  than  4  layers  could  not  be 
studied  owing  to  film  rupture  problems.  Subsequently,  they  switched  to  another 
material  14S5  (Moncton  et  al  1982)  which  was  amenable  to  the  formation  of  films  with 
as  low  as  two  layers.  It  was  found  that  the  film  with  two  molecular  layers  melted  by  a 


Figure  13.  Schematic  drawing  to  illustrate  the  outcome  of  the  experiment  of  Rosenbaum  et 
al  (1983).  Sketched  here  is  the  related  variation  of  radial  and  angular  widths  of  the  (1,0) 
diffraction  spot  as  temperature  is  increased.  As  T  becomes  larger,  the  angular  width  increases 
more  rapidly  causing  the  six  spots  to  merge  into  a  ring. 


*  Technically,  in  the  hexatic  phase  the  mean  square  angular  fluctuations  {SO2  )  -*  oo  and  x-ray  scattering 
should  give  a  ring.  However,  a  weak  substrate  field  is  enough  to  restore  the  six  spots.  On  the  other  hand,  if  the 
orientational  correlations  are  not  strong  (i.e.  are  liquid  like  as  in  (33)),  then  a  very  strong  substrate  field  would 
be  needed  to  produce  the  spots. 


Table  1.  Summary  of  papers  dealing  with  computer  simulation  experiments. 
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Figure  14.  Ultra  thin  layers  of  self-supporting  liquid  crystal  films  are  prepared  by  drawing 
the  material  across  the  open  hole  in  the  glass  cover  slide.  The  hole  is  about  6x6  mm2.  Films  as 
thin  as  two  molecular  layers  can  be  prepared  (After  Brinkman  et  al  1982). 

single  abrupt  transition  with  hysteresis  and  no  critical  behaviour  i.e.  by  a  first-order 
transition.  These  results  are  also  in  accordance  with  the  findings  of  Bishop  et  al  (1982)  in 
a  related  study  in  which  shear  mechanical  measurements  were  made. 

We  have  confined  our  survey  to  the  most  important  of  the  several  experiments  done 
so  far.  While  the  studies  on  adsorbed  layers  seem  to  favour  the  kthny  theory,  the 
melting  of  liquid  crystal  films  seem  to  point  to  a  first  order  transition.  Since  both  these 
classes  of  experiments  are  open  to  their  own  objections,  many  have  sought  an  answer 
via  computer  simulation  to  which  we  now  turn  our  attention. 

4.2  Simulation  experiments 

Computer  experiments  on  phase  transitions  have  a  long  history  going  back  to  the 
pioneering  work  of  Alder  and  Wain wright  (1962)  who  studied  hard  disc  systems.  The 
experiments  we  review  use  either  the  Monte-Carlo  approach  (Metropolis  et  al  1953)  or 
the  molecular  dynamics  approach  (Rahman  1964).  A  large  number  of  papers  have  been 
published,  of  which  table  1  offers  a  partial  summary.  Given  this  large  volume  of  work, 
we  can  indicate  only  the  highlights. 

4.2a  Advantages— An  important  advantage  of  the  simulation  experiment  is  that  it 
enables  one  to  follow  events  at  the  level  of  individual  atoms.  To  appreciate  this,  we  refer 
to  figure  15.  Here  15a  shows  an  instantaneous  snapshot  of  the  system.  The  coordi¬ 
nation  number  of  each  atom  in  this  assembly  can  now  be  analyzed  via  a 
Dirichlet/Wigner-Seitz  construction  (see  figure  6),  and  one  can  identify  those  with 
anomalous  coordination  i.e.  those  not  having  the  normal  quota  of  six  neighbours.  One 
can  now  suppress  the  “normal”  atoms  and  project  only  the  defect  structure  as  done  in 
figure  15b.  Such  studies  have  not  only  shed  much  light  on  how  defects  evolve  and 
multiply,  but  more  significant,  have  called  attention  to  the  possible  role  of  grain 
boundaries  in  the  melting  process. 

A  second  useful  pointer  to  emerge  is  the  superiority  of  the  constant  pressure  and 
isothermal  scans  as  opposed  to  traversal  along  an  isochore  (constant  density  scan).  In 
the  former,  the  distinction  between  a  first  order  and  a  continuous  transition  is  more 
clear-cut  (as  we  shall  presently  see).  In  an  isochore  traversal  on  the  other  hand,  one 
could  strike  a  coexistence  patch,  which,  besides  making  the  transition  appear 
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(a) 


( b) 


Figure  15.  (a)  shows  a  typical  configuration  of  atoms  during  the  simulation.  +  and  — 

denote  respectively  seven-  and  five-fold  coordinations.  In  (b)  is  projected  the  defect  structure 
alone  (After  McTague  et  al  1980a). 
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Figure  16.  Equilibrium  density  and  enthalpy  per  atom  as  a  function  of  temperature  for  a 
256  u  atom  system  at  P*  =  0  01.  The  sharp  breaks  are  characteristic  of  first-order  transition 
(After  Abraham  1980). 
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continuous,  could  also  give  erroneous  indications  about  the  presence  of  a  hexatic  phase. 

Yet  another  merit  of  simulation  is  the  ability  to  follow  the  trajectories  of  individual 
atoms  over  a  fairly  large  number  of  steps.  We  shall  shortly  consider  examples  of  such 
plots  and  their  utility. 

4.2b  Some  results— In  a  typical  simulation  experiment,  one  considers  an  assembly  of 
particles  interacting  via  a  suitable  potential.  The  particles  are  confined  to  a  planar  cell 
and  suitable  boundary  conditions  are  imposed  (usually  periodic  boundary  conditions; 
exceptions  however  have  been  made:  Hansen  et  al  1979;  Abraham  1981b).  The 
thermodynamic  conditions  like  pressure,  temperature  etc  are  specified  and  the 
simulation  is  executed  in  a  series  of  consecutive  steps.  Initially,  a  large  number  of  steps 
must  be  gone  through  (with  suitable  “steering”)  so  that  equilibrium  is  attained 
corresponding  to  the  desired  conditions.  Near  a  transition,  special  care  is  necessary 
since  equilibration  will  require  many  more  steps  than  usual.  After  attainment  of 
equilibrium,  the  system  is  run  through  more  steps,  during  which  “measurements”  are 
made.  In  a  constant  pressure  run,  for  example,  one  would  monitor  the  density  and 
enthalpy.  Besides  evaluating  thermodynamic  quantities,  correlation  functions  are  also 
sometimes  computed.  In  addition,  defect  patterns,  trajectory  plots  etc  are  obtained  as 
required.  This  sequence  is  then  repeated  for  a  new  set  of  initial  conditions  and  in  this 
way  a  whole  scan  is  made. 

We  consider  now  some  representative  results.  Figure  16  shows  the  equilibrium 
density  and  enthalpy  per  atom  as  a  function  of  temperature  for  a  256  Lennard- Jones 
( V(r)  =  4 e  {(<r/r)12  —  (<j/r6)})  atom  system  at  a  fixed  pressure.  Abraham  (1980)  who  did 
this  work,  carried  out  such  studies  at  three  reduced  pressures  P*  —  Po2/e  —  0*01,  0-05 
and  1.  The  first  two  pressures  were  intentionally  chosen  low  since  nh  had  speculated 
that  the  melting  could  be  first  order  at  high  pressures  but  continuous  at  low  pressures 
(see  also  figure  9).  Abraham  however  finds  that  melting  is  always  a  discontinuous 
process  as  in  figure  16. 

Figure  17  shows  results  for  isothermal  scan  obtained  by  Toxvaerd  (1981)  for  two 
Lennard-Jones  systems  with  256  and  3600  particles  respectively.  Both  exhibit  Van  der 
Waal  loops  characteristic  of  a  first  order  transition,  the  loop  being  smaller  for  the  larger 
system.  Whether  the  transition  will  continue  to  be  first  order  in  the  thermodynamic 
limit  is  difficult  to  say  but  after  investigating  the  size  dependence  from  256  to  3600 
particles,  Toxvaerd  seems  to  feel  that  the  transition  will  continue  to  remain  weakly  first 
order.  Broughton  et  al  (1982)  investigated  a  system  of  780  particles  interacting  via  a 


Figure  17.  Pressure  versus  density  at  constant  temperature  for  two  lj  systems.  The  solid 
circles  are  for  3600  particles  and  the  open  circles  are  for  256  particles  (After  Toxvaerd  1981). 
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purely  repulsive  r~12  potential.  They  too  find  a  weak,  first-order  transition.  These 
authors  also  studied  a  system  with  a  single  vacancy  which  gave  them  the  same  loop 
during  both  forward  and  reverse  scans,  unlike  the  perfect  system.  Both  Toxvaerd  and 
Broughton  et  al  have  detected  grain  boundaries  from  defect  structure  analysis  (recall 
figure  15).  They  also  cautiously  note  that  periodic  boundary  conditions  could  stabilize 
the  perfect  solid,  inhibiting  the  formation  of  vacancies  and  interstitials.  In  turn  this 
could  inhibit  dislocation  climb  and  modify  the  character  of  the  transition.  However, 
taking  all  factors  into  account,  the  overall  assessment  of  both  authors  is  that  the  melting 
transition  is  not  only  weakly  first  order  but  just  preempts  the  kt  transition. 

Special  mention  must  be  made  of  the  work  of  Bakker  et  al  (1982,  1984)  who  have 
built  a  dedicated  hardware  processor  enabling  them  to  make  a  quantum  jump.  Instead 
of  dealing  with  512  or  1024  particles  as  usual,  they  are  able  to  study  the  molecular 
dynamics  of  10864  particles.  Their  device  had  the  speed  of  a  super  computer,  achieving 
in  750  hours  what  would  have  taken  4000  hours  in  Amdahl  470  V /7-B.  One  expects  this 
approach  will  set  a  new  trend.  Incidentally,  Bakker  et  al  (1984)  have  studied  a  Lennard- 
Jones  system  along  the  isochore  p*  —  po2  —  0-88,  very  close  to  that  traversed  earlier  by 
Frenkel  and  McTague  (1979)  and  Toxvaerd  (1980).  Whereas  Frenkel  and  McTague 
interpreted  their  data  as  being  indicative  of  two  continuous  transitions  with  an 
intermediate  hexatic  phase,  Bakker  et  al ,  in  agreement  with  Toxvaerd,  conclude  that 
melting  is  in  fact  first  order,  and  that  the  intermediate  phase  is  really  a  coexistence 
region. 

This  is  a  convenient  juncture  to  introduce  trajectory  plots,  a  good  example  of  which 
is  given  in  figure  18.  The  plots  here  pertain  to  a  256  atom  Lennard-Jones  system,  and 
were  obtained  during  a  scan  along  an  isochore  (Abraham  1981a).  They  show  four 
consecutive  sequences,  each  generated  from  8  x  105  Monte-Carlo  steps.  In  each  plot 
one  can  see  a  crystalline  as  well  as  a  liquid-like  region,  indicative  of  coexistence.  Further, 
the  melted  region  appears  to  be  mobile  with  passage  of  time,  suggestive  of  a  small 
interfacial  energy  between  the  liquid  and  the  solid.  Such  a  coexistence  could  well  mimic 
a  hexatic  phase,  even  if  the  phase  were  absent. 


Figure  18.  A  consecutive  sequence  for  four  trajectory  plots  for  a  256  u  atom  system.  Each 
plot  is  generated  from  8  x  105  consecutive  Monte-Carlo  steps,  with  the  system  held  at  fixed 
density  and  temperature.  Observe  the  coexistence  of  solid  and  liquid  regions  (After  Abraham 
1981a). 
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The  simulations  discussed  thus  far  deal  with  systems  having  no  physical  boundaries. 
Abraham  (1981a,  b)  considered  the  role  of  surfaces  since  there  is  some  evidence  for 
surface  premelting  in  the  case  of  3D  crystals  (Broughton  and  Woodcock  1 978).  He  finds 
(see  also  figure  19)  that  a  2D  solid  with  two  surfaces  is  a  stable  crystal  with  premelted 
surface  at  reduced  temperature  T*  =  0*40  (T*  =  kfT /e)  and  a  stable  liquid  at  T* 
=  0*42.  An  independent  free  energy  analysis  of  the  2D  Lennard-Jones  system  yields  a 
thermodynamic  melting  temperature  of  T*  =  0*415  (Barker  et  al  1981).  Abraham 
further  finds  that  a  solid  subjected  to  periodic  boundary  conditions  melts  at  T*  ~  0*45 
with  a  discontinuous  jump  in  density  (i.e.  via  first  order).  From  this  Abraham  concludes 
that  a  solid  subjected  to  periodic  boundary  conditions  superheats  well  beyond  the 
thermodynamic  melting  temperature.  As  he  remarks  (Abraham  1981a)  the  melting 
temperature  of  the  surfaceless  solid  “corresponds  to  an  upper  limit  for  the  stability  of 
the  metastable  2D  solid”.  Associated  with  the  transition  at  T*  =  0*45,  there  is  the 
stiffness  dependence  shown  in  figure  20.  At  the  transition  temperature,  the  measured 
value  of  K  is  lower  than  \6n  “because  of  defect  formation  in  the  solid  constrained  to 
remain  at  solid  density”.  However,  if  K  values  obtained  forT*  <  0*45  are  extrapolated, 
they  yield  the  solid  curve  shown  in  figure  20.  K  thus  appears  to  jump  discontinuously  to 
zero  at  the  transition  from  the  universal  value  of  16tl  This  of  course  emerges  from  the 
renormalization  group  analysis  of  kt.  Why  it  should  do  so  likewise  in  a  first-order 
transition  is  not  clear. 

We  have  already  alluded  to  some  of  the  doubts  that  can  be  cast  on  the  experiments  on 
adsorbed  films.  One  of  these  relates  to  the  possible  role  of  the  second  layer  (which  is 
frequently  present).  To  make  an  assessment,  Abraham  (1983;  1984;  see  also  Koch  and 
Abraham  1983)  simulated  xenon  films  on  graphite,  the  geometry  of  the  computational 
box  being  as  in  figure  21.  The  simulations  covered  also  the  conditions  of  the 
experiments  of  Heiney  et  al  (1982)  and  Rosenbaum  et  al  (1983).  Abraham’s  main 
conclusion  is  that  the  transition  still  remains  first  order,  at  variance  with  the  findings  of 
the  x-ray  experiments.  Imperfections  in  the  laboratory  graphite  substrate  could  be  a 
possible  source  of  the  discrepancy. 


Figure  19.  Trajectory  plots  of  a  2D  512  u  atom  system  with  two  free  surfaces.  The  two 
temperatures  investigated  are  on  either  side  of  the  thermodynamic  melting  point  of  0-415. 
(After  Abraham  1981a). 
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Figure  20.  Parameter  K  as  a  function  of  temperature  for  a  529  u  atom  system 
corresponding  to  P*  =  0  05  (After  Abraham  1981a). 
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Figure  21.  Computational  scheme  for  the  simulation  of  the  freezing  of  xenon  on  graphite. 
The  graphite  surface  defines  the  x,  y  plane  at  z  =  0.  Shown  are  two  layers  of  adsorbed  atoms, 
the  four  faces  across  which  periodic  boundary  conditions  are  applied  and  a  reflecting  wall  at 
the  top  (After  Koch  and  Abraham  1983). 


4.2c  Some  doubts — Though  elegant,  simulation  also  is  not  free  from  objections, 
principal  among  which  are  the  following  (Broughton  et  al  1982): 

(i)  Typical  sample  sizes  of  500-1000  atoms  are  too  small  to  be  representative  of 
thermodynamic  specimens. 

(ii)  Even  the  largest  computer  simulation  run  corresponds  to  a  very  short  time  on  a  lab 
scale.  Achievement  of  equilibrium  near  a  transition  is  therefore  a  sensitive  question, 
especially  if  critical  slowing  down  is  a  possibility. 

(iii)  Periodic  boundary  conditions  could  modify  the  apparent  nature  of  the  transition. 
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(iv)  Simulation  without  an  adequate  number  of  built-in  vacancies  and  interstitials 
might  not  produce  a  sufficient  amount  of  dislocation  climb  which  is  necessary  for  a 
proper  test  of  the  kt  theory. 

There  are  a  few  other  supplementary  queries  that  may  be  raised.  It  is  fair  to  say  that 
those  engaged  in  simulation  have  given  serious  consideration  to  all  these  objections  in 
carrying  out  their  experiments  and  in  analyzing  the  results.  Some  observers  are  however 
not  still  convinced. 

4.2 d  Simulation  using  defect  Hamiltonian — We  turn  now  to  an  entirely  different  kind  of 
simulation  due  to  Saito  (1982a,  b)  who,  instead  of  working  with  atoms,  used  a  defect 
Hamiltonian  defined  on  a  triangular  mesh  with  periodic  boundary  conditions.  In  the 
thermodynamic  limit,  the  defect  Hamiltonian  goes  over  to  that  of  kt  i.e  of  continuum 
elasticity  theory  (as  it  should).  Saito  uses  the  Monte-Carlo  method.  A  pair  of  nearest 
neighbour  sites  are  chosen  at  random  from  the  mesh.  If  both  sites  are  empty,  one  tries  to 
create  a  pair  of  dislocations  with  opposing  Burgers  vectors.  If  both  sites  are  occupied 
with  dislocations  of  opposite  Burgers  vectors,  an  annihilation  is  attempted;  otherwise 
an  exchange  is  tried.  The  trial  is  accepted  according  to  the  usual  Boltzmann  weight  etc. 

Saito  finds  that  if  Ec  the  dislocation  core  energy  is  large,  the  melting  transition  is  due 
to  dislocation  unbinding  and  is  in  accord  with  the  kt  theory.  For  a  small  core  energy, 
however,  the  melting  is  of  first  order,  being  caused  by  the  nucleation  of  grain  boundary 
loops.  It  is  natural  to  speculate  whether  a  system  of  atoms  will  show  similar  dichotomy 
depending  on  whether  the  inter-atomic  potential  favours  a  large  Ec  or  not.  In 
collaboration  with  V.  Sridhar  and  B.  Chakraborty  of  our  Centre,  we  are  currently 
examining  this  question  via  simulation.  Meanwhile,  we  learn  that  Van  Himbergen 
(1984)  has  carried  out  a  similar  study  for  the  ATTmodel  and  finds  both  continuous  as 
first  order  transition,  depending  on  the  shape  of  the  nearest  neighbour  interaction.  One 
therefore  awaits  Sridhar’s  results  with  interest. 


5.  Other  mechanisms  and  viewpoints 


In  view  of  the  doubts  concerning  the  validity  of  the  kt  hypothesis,  several  alternate 
viewpoints  have  recently  been  proposed.  Guided  by  the  results  of  several  simulation 
studies  and  the  work  of  Fisher  et  al  (1979)  who  noted  that  grain  boundaries  could 
appear  before  Tm  is  reached,  Chui  (1982, 1983)  constructed  a  grain  boundary  theory  of 
melting.  Chui  models  the  grain  boundary  as  an  array  of  dislocations.  A  pair  of 
dislocations  with  Burger’s  vectors  b,  b'  interact  with  each  other  via  a  potential  of  the 
form 


V  = 


K 

4n 


b  b  In 


[b  (r  — r')]  [b'-(r  — r')]\ 
|r  — rf  )' 


(35) 


where  K  is  as  in  (17).  The  first  term  in  (35)  should  be  familiar  to  us  from  the 
corresponding  term  of  (14).  The  second  term  of  (35)  represents  the  anisotropic  part  of 
the  interaction.  (The  continuum  Hamiltonian  of  nh,  for  example,  will  have  a  third  term 
related  to  the  core  energy  Ec.  That,  however,  is  not  pertinent  here.)  A  grain  boundary 
configuration  appropriate  to  a  triangular  lattice  was  next  considered,  and  the  effective 
potential  between  parallel  grain  boundaries  at  a  distance  Z  apart  was  computed  using 
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(35).  In  turn  this  formed  the  input  for  a  free  energy  analysis  of  phase  stability.  The 
conclusions  drawn  by  Chui  are: 

(i)  grain  boundaries  do  make  an  appearance  before  Tm , 

(ii)  a  strongly  first-order  transition  results  if  the  grain  boundaries  are  coupled  to 
density  change  or  bound  dislocations  pairs, 

(iii)  the  transition  becomes  weakly  first  order  for  a  core  energy  <  2-84Tm. 

In  short,  grain  boundary  melting  preempts  the  kt  transition  in  Chui’s  theory. 

Ramakrishnan  (1982,  1984)  following  his  earlier  work  with  Yussouff  (1979), 
approaches  the  transition  from  the  liquid  side  i.e  as  a  freezing  problem.  According  to 
this  view,  the  solid  emerges  from  the  liquid  primarily  due  to  the  dominance  of  a  density 
wave  of  crystalline  periodicity  and  symmetry.  The  wavevector  G  of  this  wave  is 
identified  with  the  first  peak  in  S(q)  the  liquid  structure  factor.  From  the  quantitiative 
point  of  view,  the  theory  considers  the  solid  near  melting  as  a  calculable  perturbation 
on  the  liquid  and  shows  that  their  free-energy  difference  is  determined  by  two 
parameters  g(G)  and  c(G).  Of  these  g(G)  is  defined  by 

p(r)  =  PoU  +>7  +  m(G)  expiGY), 

where  p(r)  is  the  density.  c(G)  is  the  Fourier  transform  of  the  fluid  correlation  function 
and  is  related  to  the  structure  factor  as 

S(G)  =  (1— c(G))-1. 

A  self-consistent  relation  for  p(r)  is  derived  involving  also  the  potential  V(r),  from 
which  the  free  energy  difference  is  computed.  The  theory  predicts  a  first-order 
transition  at  a  temperature  7}. 

Ramakrishnan  (1982)  also  considers  the  implication  of  7}  being  greater  or  less  than 
the  kt  transition  temperature  Tm.  Of  course  if  7}  <  Tm ,  then  the  kt  mechanism  has  no 
chance.  On  the  other  hand  if  Tm  <7},  it  could  well  happen  that  owing  to  lack  of 
sufficient  run  time,  computer  samples  continue  to  appear  as  a  solid  and  melt  only  when 
7}  is  attained  ( via  first  order  of  course).  Ramakrishnan  therefore  advocates  suitable 
experiments  on  spatial  correlation  to  ascertain  whether  7}  <  Tm  or  7}  >  Tm  in  systems 
where  7}  is  clearly  identified. 

In  a  series  of  high-powered  (!)  papers,  Kleinert  (1983a,  b  and  references  therein)  has 
reexamined  the  kt  mechanism,  with  particular  reference  to  the  role  of  the  core  energy 
term  in  the  Hamiltonian  for  an  assembly  of  defects.  In  the  kt  theory,  the  core  energy 
term  has  a  form  ~  Ec  Z(b  f)  with  the  further  stipulation  £  b,  =  0.  In  the  Coulomb 

i  i 

model  mentioned  in  §3.4,  the  corresponding  expressions  would  b e^qf  with  =  0 

i 

(charge  neutrality). 

According  to  Kleinert,  a  single  disclination  is  really  a  point  singularity  at  the  end  of  a 
string  of  dislocations,  rather  like  the  “Dirac  string”  of  the  magnetic  monopole. 
Kleinert’s  view  is  that  a  core  energy  term  of  the  kt  type  is  inappropriate  since  it  implies 
an  infinite  energy  for  the  dislocation  string,  automatically  ruling  out  dislocation  pile  up 
and  a  concomitant  first-order  transition.  By  following  a  route  earlier  established  by 
Villain  (1975)  vis-a-vis  the  2D  If  model,  Kleinert  derives  an  expression  for  the 
partition  function  which,  unlike  that  of  kt,  is  free  from  this  “objection”.  An  interesting 
feature  of  Kleinert’s  work  is  the  use  of  duality  concept,  already  familiar  in  lattice  gauge 
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theory  (Kogut  1979),  spin  glass  (Fradkin  et  al  1978)  etc.* *.  The  net  outcome  of  Kleinert’s 
analysis  is  that  melting  in  two  dimensions  is  a  first-order  process.  Very  recently,  Janke 
and  Kleinert  (1984)  have  backed  up  this  conclusion  with  a  Monte-Carlo  simulation. 

6.  Summary  and  concluding  remarks 

Despite  the  early  start  given  by  Peierls  (1935)  and  Landau  (1937)  on  the  theoretical  side 
and  the  pioneering  experimental  work  of  Langmuir  (1938)  on  surfaces,  2D  solids  did 
not  receive  much  attention  for  a  long  time,  possibly  because  the  departure  from  true 
crystallinity  was  almost  notional*.  However,  the  deep  enquiry  in  the  sixties  concerning 
the  nature  of  ordering  in  2D  systems  revived  interest,  leading  to  the  novel  proposal  by 
Kosterlitz  and  Thouless  (1973)  that  phase  transitions  are  possible  without  being 
accompanied  by  ordering  in  the  usual  sense. 

Several  systems  (superfluid  films,  electrons  on  liquid  helium  etc)  have  been 
experimented  upon  to  test  the  validity  of  the  kt  idea  (see  for  example,  Sinha  1980).  As 
far  as  melting  in  2D  is  concerned,  the  important  question  is  whether  it  is  prompted  by 
long  wavelength  or  short  wavelength  fluctuations.  If  it  is  the  former,  then  the  transition 
is  not  only  continuous  but  has  several  unusual  features.  A  crucial  factor  is  the  role 
played  by  topological  defects,  in  particular  the  dissociation  of  bound  pairs.  On  the 
other  hand  if  melting  occurs  mainly  due  to  a  pile-up  of  disruption  of  local  arrangements 
ie.  due  to  short  wavelength  fluctuations  (which  is  facilitated  by  core  overlap — see 
remarks  in  §3.3),  then  the  transition  is  of  first  order  as  in  3D. 

We  have  presented  a  sampling  of  the  more  detailed  (lab)  experiments  on  melting 
carried  out  so  far.  Most  of  these  have  been  on  adsorbed  films  of  rare  gas  atoms,  and  the 
results  seem  to  favour  the  kthny  theory.  The  results  on  self-supporting  films  of  liquid 
crystals  seem  however  to  favour  a  first-order  transition. 

In  contrast  to  the  lab  experiments,  the  computer  experiments  do  not  have  problems 
of  specimen  preparation  and  therefore  have  been  quite  popular.  The  evidence  from 
such  simulations  appears  to  be  in  favour  of  a  first-order  transition.  However,  as  in  the 
case  of  the  lab  experiments,  the  results  are  not  totally  objection-free.1  For  some,  no 
matter  how  detailed  the  simulation,  these  experiments  come  nowhere  near  to  real  life  on 
account  of  the  various  factors  discussed  earlier.  On  the  other  hand,  the  practitioners  of 
simulation  by  and  large  seem  convinced  that  the  transition  is  of  first  order,  although 
some  are  more  cautious  than  others  in  articulating  their  opinions.  In  passing,  it  is 
interesting  to  note  that  Abraham  (1984)  in  his  simulation  studies  is  able  to  reinterpret 
the  x-ray  results  of  Heiney  et  al  (1983)  and  of  Rosenbaum  et  al  (1983)  in  support  of  a 
first  order  transition,  although  the  experimenters  themselves  feel  their  results  point  to  a 
continuous  transition! 

At  the  moment  there  is  lull  of  sorts,  with  each  “camp”  holding  on  to  its  own  view 


*  See  also  Jose  et  al  (1977)  and  Nelson  (1978). 

*  Abraham  (1980),  for  example,  estimates  that  for  a  correlation  loss  ~  6  A,  the  2D  crystal  must  have  a  size 
~  1010  light  years!  It  is  remarkable,  however,  that  diffraction  experiments  reveal  via  line  shapes  even  these 
minute  differences. 

1  Within  the  simulation  camp  itself  there  has  been  some  cross  fire!  See  Toxvaerd  (1984)  and  Abraham  and 
Koch  (1984). 
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point  *  Resumption  of  vigorous  activity  is,  however,  very  much  on  the  cards  especially 
on  the  lab  experiments  side,  with  accent  on  eliminating  the  various  objections  raised  in 
the  past.  Besides  the  ever-popular  diffraction  studies,  other  supporting  investigations 
may  also  be  anticipated,  e.g.  on  shear  properties.  Hopefully  these  will  settle  the  issue 
once  and  for  all  in  an  unambiguous  manner.  Meanwhile  one  should  not  lose  sight  of  the 
finding  of  Saito  (1982a,  b)  that  one  can  get  either  a  first  order  or  a  continuous  transition, 
depending  on  the  value  of  Ec  the  core  energy.  There  is  also  the  related  work  on  the  XY 
model  (Van  Himbergen  1984)  where  this  dichotomy  has  been  analyzed.  It  could  well 
happen  that  there  are  some  2D  lattices  which  melt  via  a  kt  mechanism  and  others  which 
do  not! 

It  would  indeed  be  a  pity  if  in  the  final  analysis  it  is  established  that  2D  melting  does 
not  occur  by  a  kt  process.  If  so,  it  would  be  a  narrow  miss,  for  the  transition  appears  to 
be  on  the  edge  of  being  a  kt  type,  as  indicated  by  many  of  the  experiments  done  so  far. 
In  a  way  this  underscores  how  sensitive  the  outcome  of  a  given  experiment  is  to 
“disturbances”  (e.g.  imperfections  in  the  substrate). 

Even  if  it  turns  out  that  2D  melting  is  first  order,  the  kt  paper  would  not  be  without 
significance,  having  exerted  a  strong  influence  on  the  study  of  phase  transitions  in  2D. 
Of  special  interest  is  the  influence  exerted  “across  the  border”  on  particle  physics.  This  is 
not  altogether  surprising,  given  the  parallels  between  statistical  mechanics  and  field 
theory  (Kogut  1979),  and  particle  theorists’  interest  in  topological  objects  (Coleman 
1977  and  Rajaraman  1982).  Thus,  whatever  be  the  final  outcome  as  regards  melting,  the 
kthny  and  all  related  papers  have  been  beneficial  in  their  own  way  to  the  progress  of 
physics,  having  made  a  beginning  concerning  the  statistical  mechanics  of  topological 
defects. 


7.  Other  reviews 

In  view  of  its  great  popularity,  there  naturally  have  been  several  reviews  on  the  kt 
transition  from  time  to  time.  It  is  pertinent  therefore  to  call  attention  to  these,  and  place 
our  own  in  the  proper  perspective. 

In  one  of  the  early  reviews,  Kosterlitz  and  Thouless  (1978)  emphasize  superfluidity, 
understandable  in  view  of  the  forum.  A  subsequent  survey  entitled  “ordering  in  two 
dimensions”  by  Kosterlitz  (1980)  is  broadbased  but  the  accent  is  almost  entirely  on 
theory.  Halperin  has  reviewed  the  subject  in  his  Kyoto  lectures  (Halperin  1979)  and  in 
the  Les  Housches  School  (Halperin  1981).  The  discussions  are  extensive  and  again  the 
emphasis  in  on  theory.  Barber  (1980)  does  not  restrict  himself  to  the  kt  problem  but 
covers  instead  the  whole  gamut  of  O (n)  models  in  2D.  In  the  process,  he  indicated 
linkages  to  the  problem  of  quark  confinement.  Nelson  (1979)  has  several  useful  and 
interesting  comments  on  the  XY  model  and  its  various  gauge  generalizations.  Young 


*  For  example.  Nelson  (1983b)  remarks  “I  doubt  that  the  simulation  techniques  used  by  Abraham  et  al 
could  reproduce  the  apparently  continuous  melting  transition  of  incommensurate  xenon  on  graphite 
observed  recently  via  precision  x-ray  diffraction  by  Heiney  et  al”.  Abraham  (1984)  on  the  other  hand  holds, 
“From  our  present  study  the  status  is  clear  for  the  high-temperature  xenon  film  on  graphite  ....  We  observe 
first-order  melting  of  xenon  .  .  .  ”. 
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(1980)  gives  a  simple  and  crisp  introduction  to  the  central  idea  of  the  kt  theory. 
Abraham  (1981a)  has  a  very  readable  article  on  simulation  experiments  but  it  is  a 
paraphrase  of  his  own  extensive  work.  Brinkman  et  al  (1982)  offer  a  brief  but  popular 
survey  of  the  2D  melting  question.  Mention  should  also  be  made  of  the  conference 
volume  edited  by  Sinha  (1980)  which  gives  a  good  overview  of  the  then  state  of  the  art 
relating  to  phase  transitions  in  2D.  Our  paper  restricts  to  the  melting  problem  alone, 
but  goes  into  some  detail  both  on  the  conceptual  as  well  as  on  the  experimental  side. 
The  discussion  of  the  experimental  situation  is  reasonably  uptodate.  The  present  review 
should  therefore  be  a  useful  complement  to  the  various  earlier  surveys. 
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Note  added  in  proof 

After  this  paper  was  sent  for  publication,  we  became  aware  of: 

(i)  A  review  by  D  R  Nelson  on  Two  Dimensional  Superfluidity  and  Melting  (in  Fundamental  Problems  in 
Statistical  Mechanics  V  1980  ed  E  G  D  Cohen  Amsterdam:  North  Holland). 

(ii)  a  paper  by  A  Holz  1984  (Phys.  Lett.  105A  472)  who  comments  on  Kleinert’s  work,  and 

(iii)  a  paper  by  S  B  Hurlbut  and  J  G  Dash  1 984  (Phys.  Rev.  Lett.  53, 1931)  who  report  a  first-order  melting  for 
4He  monolayer  on  graphite. 

Attention  is  also  invited  to  the  Conference  Proceedings  entitled  Melting ,  Localization  and  Chaos  (editors  R 
Kalia  and  P  Vashista  1982  Amsterdam:  North  Holland)  which  contains  many  papers  relevant  to  the  present 
article. 
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Defect  structure  studies  using  positron  annihilation  spectroscopy 
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Abstract,  The  positron  annihilation  method  is  a  new  addition  to  the  range  of  sensitive 
complementary  nuclear  techniques  available  for  materials’  research.  The  preferential  sensi¬ 
tivity  of  positrons  towards  micro-defect  domains  which  are  not  assessable  by  other  techniques 
makes  it  an  attractive  tool  for  many  materials  science  problems.  The  present  paper  is  intended 
as  a  brief  introduction  on  the  principle  of  measurements  and  its  potential  is  exemplified  with 
the  help  of  results  on  some  metallic  and  ceramic  systems. 

Keywords.  Defect  structure  studies;  positron  annihilation  spectroscopy;  sensitive  comp¬ 
lementary  nuclear  techniques. 

PACS  No.  61  70;  78  70 


1.  Introduction 

The  existence  of  the  positron,  the  antiparticle  of  the  electron  was  predicted  by  Dirac 
(1930)  and  experimentally  verified  by  Anderson  (1932).  An  electron-positron  pair  is 
unstable  and  the  annihilation  photons  constitute  the  signal  through  which  the 
information  on  the  behaviour  of  positrons  in  condensed  matter;  including  their 
interaction  with  imperfections,  is  obtained. 

Positron  annihilation  spectroscopy  has  many  advantages  in  the  study  of  condensed 
matter.  It  is  a  non-destructive  method.  It  is  now  well  established  as  a  technique  for  the 
study  and  characterisation  of  defect  microstructures  (Doyama  and  Hasiguti  1973; 
Seeger  1976;  Byrne  1980;  Wiffen  and  Spitznagel  1982). 

The  physical  basis  of  positron  interaction  in  condensed  matter  is  well  described  by 
various  authors  (West  1973;  Hautojarvi  1979;  Brandt  and  Dupasquier  1983).  A  brief 
account  of  the  positron  annihilation  mechanism  and  the  principle  of  measurement 
techniques  is  presented  here  along  with  some  applications  in  metals  and  ceramics. 


2.  Positron  dynamics  in  solids 

Positrons  are  usually  obtained  from  radio-nuclide  sources  such  as  22Na,  64Cu,  etc. 
which  emit  positrons  with  sufficient  energy  to  penetrate  most  materials  to  a  few 
hundred  microns.  On  implantation,  positrons  get  thermalised  rapidly  ( «  1  psec)  by  a 
succession  of  ionization  and  phonon  scattering  events.  The  thermalisation  times  are 
usually  much  shorter  as  compared  to  the  mean  lifetime  of  positrons  in  solids  (Kubica 
and  Stewart  1975). 
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MEDIUM  ANNIHILATION  LIFETIME  I 

MODE  (ns) 


Figure  1.  Positron  annihilation  modes  in  solids:  (A)  free  annihilation,  (B)  trapping  at  point 
defects  and  (C)  positronium  formation  and  subsequent  decay  by  (Cj)  para,  (C2)  ortho  and  (C3) 
pick-off  annihilation  processes. 

The  injected  positrons  are  distributed  in  the  medium  in  various  possible  states. 
Figure  1  summarises  schematically  the  experimentally  observable  alternative  annihi¬ 
lation  modes.  Path  A  represents  the  annihilation  of  positrons  with  free  electrons  into 
two  y-rays  of  51 1  keV  each,  in  approximately  0*  1  to  0-5  nsec  depending  upon  the  density 
of  the  free  electrons  in  the  solid.  Positron  wave  functions  can  be  localized  at  point 
defects  (path  B).  Since  the  electron  density  at  these  sites  is  lower  than  in  the  perfect 
crystal,  the  mean  lifetime  of  the  trapped  positrons  becomes  longer  (0*2-1  *0  nsec)  than 
that  of  the  free  positrons.  These  two  annihilation  modes  are  typical  for  metals. 

In  insulators,  positrons  can  capture  an  electron  (path  C)  to  form  positronium,  Ps,  the 
hydrogenic  two  body  e+  -e~  system  (be  =  6*8  eV), 

In  a  diamagnetic  medium,  the  positronium  distribution  is  initially  in  the  singlet 
(p-P s)  and  triplet  (o-Ps)  states  in  the  ratio  of  1 : 3.  The  intrinsic  lifetime  of  p-Ps  is 
0*125  nsec,  about  the  same  as  the  free  positron  lifetime  in  dense  metals,  whereas  for 
o-Ps  it  is  140  nsec.  However,  o-Ps  in  solids  decays  mainly  by  a  competing  mechanism 
called  the  ‘pick-off’  process,  where  the  positron  in  the  o-Ps  suffers  a  2y  annihilation  in 
collision  with  a  valence  electron  having  opposite  spin  (path  C2).  Therefore,  the  o-Ps 
lifetime  is  shortened  to  a  few  nano  seconds  (0*6  to  10*0  nsec),  which  is  again  a  function  of 
the  electron  density  in  the  medium. 

Both  the  positron  and  the  positronium  have  a  strong  tendency  to  get  localised  in  the 
regions  of  low  electron  density.  This  localization  is  reflected  in  the  subsequent 
annihilation  radiation  and  thus  provides  the  basis  for  using  positrons  as  a  probe  for  the 
study  of  defects  in  solids. 

3.  Principle  of  pas  measurements 

Positron,  either  free  or  bound,  interacts  with  more  than  one  electron  in  different  energy 
states  in  a  many  particle  system.  Basically  two  kinds  of  information  are  obtained:  (1)  that 
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related  to  the  electron  density  and  (2)  that  of  the  distribution  of  electrons  in  momentum 
space.  Thus,  three  different  types  of  measurements  can  be  performed:  (i)  positron 
lifetime  or  more  specifically,  the  positron  decay  spectrum;  (ii)  angular  correlation  of  the 
direction  of  the  photon  emitted  during  2y-annihilation;  (iii)  doppler  broadening  of 
511  keV  annihilation  radiation. 

The  time  delay  between  the  injection  of  positrons  (marked  by  the  1-28  MeV  nuclear 
y-ray  using  a  22Na  source)  and  the  detection  of  511  keV  annihilation  photon  in  the 
solid,  measured  over  a  large  number  («  105)  of  individual  events  constitutes  the 
lifetime  spectrum  (Smedskjaer  and  Fluss  1983).  From  such  time  distributions,  one  can 
deduce  the  decay  rate  of  positrons  from  experimentally  distinguishable  states  in  the 
material.  For  ionic  solids,  like  alkali  halides  and  metal  oxides,  generally  two  or  three 
lifetime  components  are  obtained.  The  fastest  component  zx  is  assigned  to  free  and 
p  Ps  decay,  while  the  longer  ones  x2  and  t3,  to  the  pick-off  annihilation  of  o-  Ps  in  the 
crystalline  and  disordered  regions  respectively  (Bertolaccini  et  al  1971;  Brandt  and 
Paulin  1968). 

If  the  centre  of  gravity  of  the  annihilating  electron-positron  pair  is  at  rest,  two  y-rays 
must  be  emitted  with  exactly  equal  energy  (511  keV)  and  in  opposite  directions. 
However,  because  of  the  non-zero  momentum  of  the  pair,  the  photons  deviate  from 
collinearity,  ie;  the  two  y-rays  are  emitted  in  directions  n±0  radians.  In  addition,  the 
energy  of  the  annihilating  photon  undergoes  a  Doppler  shift  (51 1  Hh  A E)  keV.  Since  the 
momentum  of  the  thermalised  positron  is  very  small  relative  to  that  of  the  electron,  the 
distribution  of  0  in  angular  correlation  and  A E  in  the  Doppler  broadening  experiments 
therefore  represent  essentially  those  of  the  crystal  electrons,  both  core  and  valence.  The 
higher  momentum  core  electrons  contribute  proportionally  more  to  the  largest  values 
of  0  and  A E.  Therefore,  the  y-y  angular  correlation  curves  and  Doppler  broadened 
spectra  are  both  more  sharply  peaked  for  solids  containing  defects  and  trap  positrons, 
since  there  are  fewer  core  electrons  at  the  defect  site  (Doyama  1979). 

For  quantitative  analysis  of  defects  in  solids,  parameters  deduced  from  pas 
measurements  are  employed  with  the  help  of  trapping  models  (Brandt  1967;  Bergersen 
&  Stott  1969). 


4.  pas  study  of  vacancy  concentrations  in  metals 

pas  has  a  distinct  advantage  in  the  study  of  vacancy  concentration  in  metals,  because  it  is 
possible  to  carry  out  these  studies  under  thermal  equilibrium  conditions  and  at  very  low 
vacancy  concentration  («  10-6)  where  only  monovacancies  dominate  the  whole 
equilibrium  ensemble. 

The  common  observation  in  metals  is  that  the  positron  annihilation  parameter 
shows  a  typical  S-shaped  curve:  a  thermal  expansion  effect  at  low  temperatures  which 
gets  superimposed  with  a  vacancy  trapping  effect  at  higher  temperatures  and  a 
saturation  near  the  melting  point,  where  it  is  assumed  that  ail  the  positrons  get 
annihilated  from  the  trapping  sites  (MacKenzie  et  al  1967;  McKee  et  al  1972; 
Triftshauser  1975;  Seegers  et  al  1978).  Figure  2  shows  the  typical  lineshape  parameter 
dependence  for  Cu  as  a  function  of  temperature  (Fluss  et  al  1980).  For  evaluating  £t), 
the  formation  energy  of  a  vacancy,  the  experimentally  measurable  quantities  Fb  and  Fv 
which  represent  the  specific  values  of  positron  annihilation  parameter,  F  (eg  lifetime. 
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Figure  2.  The  experimental  (□)  Doppler-broadening  lineshape  parameter  F(T)  as  a 
function  of  temperature  T,  and  the  monovacancy  two-state  trapping  model  fit  to  the  data  (solid 
curve).  After  Fluss  et  al  1980. 


ratio  of  the  peak  height  to  the  area  of  angular  correlation  curve,  line-shape  parameter 
etc.),  in  the  free  and  trapped  states  respectively,  are  used  in  an  Arrhenius  relationship. 


j~  +  In  (Axb) 


(1) 


where  k  is  the  Boltzmann  constant,  T is  the  absolute  temperature,  A  is  a  constant  and  zb 
is  the  lifetime  for  the  perfect  lattice.  The  values  of  Ev  determined  from  pas  technique  are 
found  to  be  in  good  agreement  with  those  obtained  from  other  experiments  (Siegel 
1978;  Doyama  1979;  West  1979). 

5.  Radiation  damage  studies 

Study  of  radiation  damage  is  important  from  the  point  of  view  of  understanding  the 
behaviour  of  point  defects  in  solids.  The  sensitivity  of  the  positron  annihilation 
spectroscopy  technique  to  vacancies  and  their  aggregates  has  made  it  particularly  useful 
in  elucidating  the  recovery  processes  in  metals  and  alloys  after  neutron  and  electron 
irradiation  (Mogensen  et  al  1972;  Cotterill  et  al  1972).  From  the  pas  data,  it  is  possible 
to  identify  the  migrating  entity  and  determine  the  average  dimensions  of  vacancy 
clusters  or  microvoids,  which  are  below  the  limit  of  conventional  electron  microscopy 
(Seeger  1973;  Petersen  et  al  1975;  Mantl  and  Triftshauser  1975). 

Irradiation  at  low  temperatures  predominantly  results  in  the  production  of  single 
vacancies  and  interstitials.  The  defect  recovery  in  copper  crystals  (Mantl  and 
Triftshauser  1975)  irradiated  by  3  MeV  electrons  at  10  K,  was  followed  by  residual 
resistivity  and  Doppler  broadening  parameters  expressed  in  terms  of  Iv  (corresponding 
to  valence  electrons)  and  Ic  (corresponding  to  core  electrons)  (figure  3).  The  initial  drop 
in  resistivity  indicates  that  the  number  of  trapping  sites  for  positrons  is  reduced.  Thus, 
the  increase  in  Iv  and  the  corresponding  decrease  in  Ic  can  be  interpreted  as  due  to  the 
aggregation  of  vacancies  into  small  three-dimensional  clusters,  which  eventually 
collapse  into  vacancy  loops,  often  observed  in  electron  microscopy  studies.  These 
results  thus  support  the  vacancy  model  of  stage  III  for  migration  of  point  defects 
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Figure  3.  Dependence  of  the  lineshape  parameters  lv  and  Ic,  and  the  electrical  resistivity  for 
electron  irradiated  copper,  as  a  function  of  the  annealing  temperature.  (After  Mantl  and 
Triftshauser  1975). 


(Schilling  and  Sonnenberg  1973).  This  was  further  verified  in  the  case  of  copper  from 
measurement  of  positron  lifetimes  (Hinode  et  al  1978). 

6.  Defects  in  non-metals 

In  non-metals,  the  positron  annihilation  modes  are  quite  complex — usually  charac¬ 
terised  by  two  or  more  decay  components — and  the  reason  for  this  has  still  not  been 
established.  However,  positron  annihilation  behaviour  in  molecular  solids  like 
polymers,  ionic  solids  like  alkali  halides,  oxides  and  the  silicate  glasses  is  explained, 
partly  because  of  the  observations  made  with  other  conventional  techniques  (Walker 
1979;  Dupasquier  1979).  As  positron  trapping  by  defects  is  the  basic  phenomenon,  the 
established  analytical  methods  are  equally  effective  in  analysing  the  pas  results. 

The  formation  of  a  Ps-like  bound  state  is  no  longer  speculative  and  several 
confirmatory  experiments,  involving  measurements  under  different  atmospheres 
(Steldt  and  Varlashkin  1972),  three-to-two  photon  relative  yield  measurements  (Sen 
and  Patro  1969),  magnetic  quenching  experiments  (Yam  et  al  1978;  Judd  et  al  1979)  etc., 
have  helped  to  reveal  the  annihilation  mechanism.  Figure  4  shows  the  y-y  angular 
correlation  results  on  pure  and  doped  polycrystalline  samples  of  Th02.  An  increase  in 
the  peak  counting  rate  in  the  presence  of  transition  metal  impurities  represents  the 
enhancement  in  the  2 y  annihilation  events  compared  to  that  in  pure  Th02.  This  is 
attributed  to  the  increase  in  the  pick-off  quenching  rate  of  o-Ps  in  the  presence  of  these 
ions. 

7.  Positron  annihilation  in  fine  powders 

Positronium  formation  in  metal  oxide  powders  was  first  reported  for  A1203,  MgO, 
CaO  and  ZnO  (Kusmiss  and  Stewart  1967).  The  angular  correlation  curves  showed  a 
low  intensity  narrow  component  attributed  to  the  annihilation  of  p-Ps,  formed  on  or 
near  the  surface  of  the  finely  divided  powder  particles.  Earlier  observations  of  long  time 
components  (r3  ~  3-5  ns,  /3  ~  6%)  observed  for  alkaline-earth  oxides  (Bussolati  and 
Zappa  1964)  and  the  additional  intermediate  component  (T2~05ns,  /2  ~  12%) 
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Figure  4.  Angular  distribution  of  annihilation  photons  in  Th02:  pure  (X)  and  doped  with 
Ti02  (A)  and  Fe203  (□). 


obtained  in  several  oxides  and  fluorides  (Paulin  and  Ambrosino  1967)  also  gave  a  clear 
evidence  for  the  decay  of  a  Ps-like  bound  state  in  the  medium. 

The  most  remarkable  feature  observed  is  that  the  annihilation  characteristics  are 
strongly  dependent  on  the  particle  size  of  the  powders.  For  Zr02  powders  (Mitsuhashi 
et  al  1972),  it  was  found  that  the  positron  life  time  is  insensitive  to  its  allotropic 
modification  but  is  dependent  only  on  the  particle  size.  Life  time  components  and  t2 
were  found  to  increase  as  the  particle  size  becomes  small. 

Brandt  and  Paulin  (1968)  demonstrated  a  strong  dependence  of  life  time  spectra  on 
particle  size  of  MgO,  A1203  and  Si02.  The  systematic  narrowing  of  the  angular 
correlation  curve  (Steldt  and  Varlashkin  1972)  and  the  increase  in  the  pick-off  life  time 
component  t2,  (Baranowski  et  al  1977)  with  decreasing  particle  size  of  MgO  powders 
are  also  consistent  with  the  results  of  previous  authors.  The  decrease  in  mean  life  time 
with  improvement  in  crystalline  perfection  is  a  general  phenomenon  observed  in  metal 
powders  (Noguchi  and  Miyata  1979),  graphite  (Iwata  et  al  1981),  diamond  (Mokrushin 
and  Breusov  1980),  etc. 

Figure  5  shows  the  life-time  results  on  microcrystalline  Th02  powders  (Upadhyaya 
et  al  1982a).  A  linear  plot  correlating  Tj  and  t2  with  crystallite  size  deduced  from  X-ray 
line  broadening,  established  that  the  finer  particles  correspond  to  a  higher  defect 
density.  The  annihilation  probability  for  both  free  (t^  and  o-Ps  pick-off  (t2)  processes 
is  reduced  in  the  pseudo-amorphous  powders  due  to  lower  than  average  electron 
density.  With  increasing  crystalline  perfection,  the  free  volume  at  the  trapping  site 
decreases  which  enhances  the  overlap  of  the  positron  wave  function  with  that  of  the 
valence  electrons  in  the  medium.  Deviation  in  the  value  of  the  positron  parameter  from 
that  of  the  single  crystal  state  is  thus  a  quantitative  measure  of  the  degree  of  disorder  in 
the  powdered  state.  This  leads  to  the  suggestion  that  a  reexamination  of  the  intrinsic 
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Figure  5.  Positron  lifetime  components  xy  and  t2  versus  primary  particle  size  (measured  by 
X-ray  diffraction  line  broadening  method)  correlation  for  finely  divided  thoria  powders. 


values  of  positron  life  times  reported  so  far  by  employing  powder  samples  may  be 
needed. 

Another  important  aspect  of  positron  dynamics  in  fine  powders  is  concerning  their 
diffusion  mechanism — both  from  the  point  of  view  of  moderation  as  well  as  trapping 
(Barko  et  al  1979;  Brandt  1974).  The  injected  positrons  after  thermalization  and  Ps 
formation  scan  the  powder  grains  isotropically.  The  rate  of  trapping  at  defects  and  the 
escape  rate  at  the  surface  are  proportional  to  the  diffusion  constant  of  the  migrating 
species.  Assuming  the  Ps  to  be  formed  with  equal  probability  inside  a  small  insulating 
solid  of  spherical  shape,  the  escape  rate  can  be  derived.  The  relative  intensity  of  the  life 
time  component  assigned  to  the  surface,  when  expressed  as  a  function  of  particle  size 
enables  one  to  calculate  the  Ps  diffusion  constant  (D+).  For  oxides,  it  was  found  to  be  of 
the  order  of  10” 5  cm2  sec” 1  (Brandt  and  Paulin  1968;  Upadhyaya  et  al  1982b),  which  is 
about  three  orders  of  magnitude  smaller  than  for  metals  (Paulin  et  al  1974;  Matsuoka 
et  al  1980;  Noguchi  and  Miyata  1979). 

Positron  diffusion  constant  in  powders  can  also  be  measured  using  the  following 
expression  (Brandt  1974) 

Y=(D+zb)l/2Sd  (2) 

where  Fis  a  comprehensive  life  time  parameter,  rb  the  intrinsic  value  of  life  time  in  the 
solid  and  the  quantity  Sd,  which  is  the  product  of  the  specific  surface  area  of  the  powder 
and  the  theoretical  density,  is  equivalent  to  the  inverse  of  the  mean  particle  diameter. 

The  values  of  the  escape  probability  Is  and  Y  for  thoria  powders  are  plotted  in  figure  6 
(Upadhyaya  et  al  1982c).  It  shows  an  increase  in  the  intensity  of  the  surface  life  time 
component  with  increasing  surface  area.  The  slope  of  the  Y  vs  Sd  graph  yields  the  Ps 
diffusion  constant  in  thoria  as  80  x  10” 5 cm2  sec”  \  equivalent  to  a  Ps  diffusion  length 
of  about  60  A. 
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Figure  6.  Sd  product  vs  (i)  escape  probability  (Is)  of  o-Ps  to  the  powder  particle  surfaces  and 
(ii)  and  Y-parameter  dependence  for  thoria  powders. 


Summary 

Positron  annihilation  spectroscopy  has  made  a  significant  impact  in  our  understanding 
of  the  defect  structure  in  solids.  The  favourable  combination  of  properties  possessed  by 
positrons  makes  them  specially  valuable  as  probes  of  microscopic  and  atomic  scale 
imperfections.  It  has  been  found  to  be  a  sensitive  tool  for  the  study  of  electron  structure 
in  metals  and  also  has  a  great  potential  in  the  study  of  point  defects — especially  lattice 
vacancies.  This  selective  nature  of  the  interaction  has  helped  in  explaining  various 
metallurgical  phenomena.  The  defect  recovery  processes  associated  with  the  irradiation 
damage  and  various  thermomechanical  treatments  in  metals  and  alloys  have  been 
investigated  to  great  advantage. 

pas  has  successfully  utilized  oxide  powder  systems,  primarily  as  a  suitable  medium  to 
test  and  rationalize  experimentally  the  predictions  of  quantum  electrodynamics  about 
the  positronium  formation  and  its  decay  reactions.  In  this  process,  it  has  contributed  to 
our  understanding  of  defect  structure  in  ceramics,  in  much  the  same  way  as  it  has  in 
metals.  Results  cited  here  elucidate  the  role  of  material  fineness  on  Ps-dynamics,  ie,  the 
diffusion  constant,  pick-off  annihilation  rate,  etc.,  employing  a  medium  which  has  been 
well  characterized  by  a  variety  of  conventional  techniques. 
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Abstract.  Development  of  alloys  for  reliable  performance  in  extreme  radiation  environ¬ 
ments  is  vital  for  the  viability  of  advanced  nuclear  reactor  systems.  Over  the  past  decade,  there 
has  been  a  considerable  growth  in  our  understanding  of  the  basic  processes  of  radiation 
damage,  the  nature  of  the  induced  defects,  their  interaction  and  migration,  and  the  influence  of 
these  on  the  mechanical  behaviour  of  metals.  This  understanding  has  however  come  mainly 
from  studies  in  pure  metals  and  dilute  alloys,  and  there  are  difficulties  when  applying  these 
concepts  to  concentrated  alloys,  particularly  of  technological  interest.  The  present  article, 
which  attempts  to  bridge  this  gap,  discusses  recent  research  developments  and  some  of  the 
emerging  new  concepts  as  applicable  to  alloy  systems.  Interstitialcy  transport;  percolation 
effects  in  defect  migration;  short  range  and  long  range  ordering  and  restructuring  of  alloys; 
defects  and  damage  behaviour  of  metallic  glasses;  synergetic  processes  and  phase  instabilities; 
and  finally,  swelling,  irradiation  creep  and  ductility  behaviour  of  alloy  systems  are  the  topics 
discussed. 

Keywords.  Alloy  systems;  interstitialcy  transport;  percolation  effects;  defect  migration;  short 
range  ordering;  long  range  ordering;  metallic  glasses;  swelling;  irradiation  creep;  ductility. 

PACS  No.  61  80 


1.  Introduction 

Growth  of  commercial  nuclear  reactors  and  development  of  advanced  reactor  systems 
have  led  to  a  demand  for  new  alloys  which  can  reliably  be  used  in  radiation 
environments.  Considerable  advance  has  been  made  in  our  understanding  of  radiation 
effects  in  materials  over  the  past  decade  for  safe  and  economic  operation  of  nuclear 
reactors.  For  the  pressurized  heavy  water  reactors  (phwrs)  for  example,  extensive 
investigations  have  been  necessary  to  understand  the  in-reactor  behaviour  of  zirconium 
alloys,  and  to  improve  the  performance  of  zirconium  alloy  fuel  tube  and  pressure  tubes 
through  appropriate  control  of  texture  and  microstructure  during  tube  fabrication 
(Sinha  and  Kakodkar  1983).  In  the  case  of  fast  breeder  reactors  (fbrs),  radiation  damage 
effects  in  structural  components  are  orders  of  magnitude  more  severe  because  of  the 
increased  damage  rates  due  to  fast  neutrons  and  embrittlement  problems  associated 
with  the  production  of  helium  by  (n,  a)  reactions.  With  the  alloy  development  and 
evaluation  efforts  over  the  last  decade  (see  the  reviews  by  Rodriguez  et  al  1984; 
Sundaram  1984)  alloys  like  modified  stainless  steels,  ferritic  steels  and  nimonic  alloys 
have  emerged  showing  reduced  swelling,  irradiation  creep  and  helium  embrittlement 
behaviour.  In  more  advanced  reactor  concepts  like  fusion  systems,  the  damage  to  the 
first  wall  by  direct  impingement  of  light  and  heavy  ions  can  erode  the  surface  besides 
affecting  the  mechanical  properties  (Gruen  1979).  Here,  along  with  the  development  of 
new  alloys  ( e.g .  V-15%  Cr-5%  Ti)  new  technologies,  such  as  rapid  solidification  (to 
produce  metallic  glasses),  formation  of  surface  coatings  (e.g.  TiC),  have  also  been 
investigated  to  ascertain  their  superior  radiation  damage  resistance. 
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In  all  these  developments  significant  contributions  have  come  from  the  more  basic 
aspects  of  radiation  damage  research.  There  has  been  a  tremendous  growth  in  our 
understanding  about  the  mechanism  of  damage  and  the  behaviour  of  vacancies;  self¬ 
interstitials;  their  configurations  and  clustering  kinetics;  precipitation;  diffusion  of 
gases  like  helium;  and  the  effect  of  stress  and  temperature  on  these  defects  (see  the 
review  of  Ullmaier  and  Schilling  1980).  This  has  emerged  mainly  from  studies  in  pure 
metals,  often  on  single  crystals  and  by  investigating  simple  defect  structures.  To  some 
extent  all  this  can  be  extended  to  dilute  alloys  but  there  are  serious  difficulties  while 
applying  these  principles  to  concentrated  alloys.  Here,  even  the  basic  building  blocks 
like  vacancies  and  self-interstitials  become  diffuse  and  their  structure  and  properties  are 
not  understood.  Even  well-established  facts  in  pure  metals  like  the  high  mobility  of  self¬ 
interstitials  cannot  be  assumed  since  there  is  gradually  emerging  evidence  that  in  some 
alloys  they  may  be  slower  than  vacancies  and  become  mobile  only  at  higher 
temperature.  Emphasis  now  in  basic  research  is  shifting  to  radiation  damage  problems 
in  concentrated  alloys.  This  shift  is  encouraging  since  it  brings  basic  research  into 
contact  with  materials  of  technological  importance  which  have  so  far  mainly  remained 
in  the  realm  of  metallurgical  research. 

Basic  research  efforts  at  understanding  the  radiation  behaviour  of  alloy  systems  are 
still  in  the  early  stages.  This  article  discusses  some  of  the  new  concepts  which  are 
emerging  in  this  field  and  the  differences  from  pure  metals  are  highlighted.  In  §2  the 
damage  mechanisms  are  discussed  together  with  defects  and  their  mobility  in  alloys. 
Section  3  deals  with  the  secondary  effects  of  damage;  segregation  and  dissolution, 
order-disorder  and  damage  effects  in  metallic  glasses.  The  highly  non-equilibrium 
nature  of  the  processes  involved  introduces  interesting  possibilities  of  synergetic  effects 
which  could  manifest  as  phase  instabilities.  These  are  briefly  discussed  in  §  4.  Finally,  §  5 
relates  these  structural  changes  in  alloys  with  their  metallurgical  properties  and 
behaviour.  Accelerators  are  increasingly  being  used  for  such  investigations  and  some  of 
the  recent  results  are  presented. 


2.  Displacement  damage  and  defects  in  alloys 

The  process  of  atomic  displacements  and  the  production  of  the  initial  damage  structure 
can  be  discussed  under  three  headings  (i)  primary  and  secondary  atomic  displacements, 
(ii)  defects  and  their  mobility  and  (iii)  mixing  effects  and  structural  changes. 

2.1  Displacement  process 

2Aa  Primary  displacements :  The  early  stages  of  damage  in  alloys  are  essentially 
similar  to  those  in  pure  metals.  They  involve  the  creation  of  the  primary-knock  on- 
atoms  (pkas)  by  the  incident  irradiation  and  secondary  damage  as  Frenkel  pairs  and 
cascades  produced  in  turn  by  the  pkas.  For  an  incident  monoenergetic  radiation 
(neutrons/ions/electrons)  with  energy  E ,  the  pkas  acquire  a  spectrum  of  energies  due  to 
recoils  of  energy  T.  This  energy  spectrum  is  governed  by  the  nuclear  scattering  and 
involves  the  nuclear  differential  cross-section  dop(E,T)  which  depends  on  the  masses 
and  nuclear  charges  (MnZ,e)  and  ( Mp,Zpe )  of  the  incident  (/)  and  target  atom 
(P  =  A,  B,  ...  ,  for  alloy  atoms)  respectively.  In  non-relativistic  situations  the  maxi¬ 
mum  recoil  energy  transferred  is  Tmax,  Tmax  p  =  AM  ,MpE/(M ,  +  Mp)2.  For  neutrons 
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due  to  hard  sphere  collision  of  radius  R,  dop(E,T)  =  R2  dT /7 max  which  is  in¬ 
dependent  of  T.  In  the  case  of  charged  particles,  Rutherford  scattering  predominates 
and 

dap(E,T)  =  nZfZ2pe*  M,dT/MpET2.  (1) 

In  this  case  there  is  a  strong  dependence  of  d op(E,T)  on  T  showing  that  most  of  the 
energy  loss  occurs  with  small  recoil  energies  at  approximately  right  angles  to  the 
incident  particle  direction. 

Consider  a  binary  alloy  having  elements  with  well  separated  masses  MA,  MB  (and 
charges  ZA ,  ZB).  The  strong  dependence  on  these  variables  of  dop{E,  T)  and  Tmax 
results  in  two  well-defined  pka  spectrums.  In  an  alloy  of  NiAl  for  instance,  dcrNl/dcrA1 
~  2  at  all  recoil  energies  and  7max  Ni  /7max  A,  ~  0-5  as  seen  from  figure  1  a.  An  interesting 
situation  arises  in  the  case  of  irradiation  by  energetic  electrons  05  MeV)  and  very 
low  energy  light  ions  (~  keV).  Due  to  the  large  difference  in  the  maximum  recoil 
energies  it  could  so  happen  that  the  entire  damage  process  is  confined  only  to  the  lighter 
atom  species  constituting  the  alloy.  A  number  of  alloys  like  stainless  steel  which  are 
technologically  important  contain  a  significant  amount  of  light  elements  like  C,  A1  and 
Si.  These  elements  could  be  either  dispersed  or  could  be  in  the  form  of  precipitates 
generically  called  y':  like  Ni3Al,  Ni3Si,  and  are  responsible  for  the  hardening 
characteristics  of  alloys.  The  process  of  selective  displacements  upsets  the  equilibrium 
of  the  thermally  formed  composition  resulting  in  the  restructuring  of  the  alloy. 

The  subthreshold  effects  dominate  at  low  energies  (except  for  electrons)  where  the 
scattering  cross-section  is  high  as  seen  from  figure  la.  However  at  higher  energies  the 
displacement  damage  by  the  heavier  masses  becomes  more  significant  as  can  be  seen 
from  figure  1  b.  There  is  however,  another  feature  which  becomes  important  in  alloys 
even  if  there  is  no  mass  difference  between  the  alloying  elements.  This  arises  from  the 
dependence  of  the  pka  spectrum  on  the  local  atomic  concentrations  (r)  and  t>B  (r), 


Figure  1.  (a)  Recoil  energy  spectrum  of  Ni  and  A1  atoms  in  NiAl  alloy  with  2  MeV  protons 

(b)  Comparative  damage  rates  produced  by  the  Ni  and  A1  PKA  spectrums  for  different  proton 
energies  (Panigrahi  1984). 
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(<%  +  <%  =  !)•  The  pka  spectrum  (ignoring  mixing  effects)  is  given  by 

pka AB  (r ,T)  =  (daAB/dT)  Nc4  B(r),  (2) 

where  N  is  the  total  atomic  concentration.  The  spatial  variation  in  (r)  and  <%  (r)  arises 
because  the  distribution  of  A  and  B  atoms  would  depend  on  whether  the  alloy  has 
stoichiometric  order,  short  range  order  (sro),  long  range  order  (lro)  or  is  completely 
disordered.  Thus  even  though  the  irradiation  is  completely  homogeneous,  in¬ 
homogeneities  in  the  pka  spectrum  lead  to  gradients  in  the  defect  structure  introducing 
fluxes  of  atoms  and  defects  which  can  restructure  the  alloy  (Krishan  and  Abromeit 
1984). 

2.1/?  Secondary  displacement  effects:  The  process  by  which  pkas  produce  secondary 
displacements  is  via  replacement  collision  sequences  where  each  atom  displaces  the 
neighbouring  one  along  the  crystallographic  directions.  A  vacancy  is  generally 
produced  at  the  beginning  of  the  chain  and  the  replacement  sequence  propagates  3  to  6 
lattice  distances  (sometimes  as  long  as  20)  till  the  extra  atom  is  displaced  to  an 
interstitial  position.  For  a  replacement  sequence  the  pka  must  transfer  an  energy 
between  7}  and  Td  where  7}  is  the  focussing  energy  and  Td  the  minimum  threshold 
displacement  energy  to  form  a  stable  vacancy-interstitial  atom  pair.  Td  is  typically 
10-30  eV  and  depends  on  the  crystallographic  direction.  The  higher  limit  7}  arises 
because  of  dechannelling  effects  and  for  energy  transfers  larger  than  these,  cascades  are 
formed  with  several  vacancies  (^  1000  for  typical  large  cascades)  and  the  atoms  are 
removed  to  peripheral  interstitial  sites  by  several  replacement  sequences. 

The  scenario  of  separation  of  vacancies  from  self-interstitials  via  replacement 
sequences  to  escape  spontaneous  correlated  recombination  needs  to  be  considered 
more  carefully  in  alloys.  The  complete  transfer  of  energy  from  the  moving  to  a 
stationary  mass,  when  the  masses  are  equal,  forms  the  basis  on  which  replacement 
sequences  are  formed.  In  alloys,  where  different  masses  are  involved,  there  will  be  a 
large  dissipation  of  energy  during  transfer  and  this  effectively  inhibits  the  propagation 
of  such  sequences.  Figure  2  shows  four  different  situations  which  can  arise  and  these 
will  be  discussed. 

Figure  2a  shows  an  ordered  FeAl  alloy.  In  this  alloy  we  find  that  channels  along  the 
<111)  direction  are  blocked  and  no  replacement  sequences  can  propagate;  on  the  other 
hand  the  <100)  and  <110)  channels  are  free.  The  threshold  displacement  energy 
surface  will  show  a  high  degree  of  anisotropy  in  such  alloys.  There  have  been  very  few 
detailed  studies  of  such  effects  though  some  resistivity  measurements  have  been  made 
at  low  temperatures  (Rivier  and  Dinhut  1981)  after  irradiating  with  electrons  along 
various  crystal  orientations.  Figure  2b  shows  a  plane  of  atoms  for  a  NiAl  alloy.  Such 
alloys  are  composed  of  two  interpenetrating  simple  cubic  sublattices  a  and  /?.  On  the  A- 
rich  side  the  excess  A-atoms  are  accommodated  on  the  /i-sublattice  as  antistructure 
defects,  while  on  the  B  rich  side  structural  vacancies  are  formed  on  the  a-sublattice.  The 
existence  of  these  vacancies  which  can  be  as  much  as  10%  would  not  make  it  possible 
for  any  collision  sequences  to  operate  on  the  a-sublattice  as  shown  in  the  figure  2b. 
Such  vacancies  would  play  a  very  effective  role  as  recombination  centres  and  therefore 
significant  differences  can  be  expected  in  the  formation  of  the  microstructure  as  a 
function  of  alloy  composition.  In  another  class  of  alloys  viz  CuZn  and  AuCo  the  lattice 
defects  consist  of  A-atoms  on  /5-sites,  B-atoms  on  a-sites  and  vacancies,  the  behaviour 
of  replacement  sequences  would  be  correspondingly  different.  A  more  complex 
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Figure  2.  The  blocked  channel  directions  (broken  arrows)  and  free  channels  (full  arrows) 
along  which  replacement  sequences  can  operate  producing  Frenkel  pair  damage  in  alloys 
(a)  Unequal  masses  do  not  allow  replacement  channels  to  propagate  along  some  directions  like 
<  1 1 1  )  for  FeAl  alloys  (b)  In  alloys  which  are  away  from  stoichiometric  composition  vacancies 
appear  as  structural  defects  on  one  of  the  sublattices  blocking  all  replacement  channels  in  the 
sublattice.  The  shaded  dots  are  on  a  separate  plane  and  belong  to  a  different  sublattice,  (c)  In  a 
disordered  alloy  with  unequal  masses  all  channels  would  be  blocked  but  for  the  short  range 
clustering  which  allows  replacement  sequences  (d)  In  glassy  alloys  the  short  range  order  is 
sufficient  to  allow  short  replacement  sequences  to  occur  making  them  also  vulnerable  to 
damage. 


situation  arises  in  cases  where  the  alloy  is  disordered  as  shown  in  figure  2c.  In  such 
cases  if  the  mass  difference  is  large  it  should  be  very  difficult  for  replacement  sequences 
to  operate  along  any  direction  making  them  ideal  radiation  damage  resistant  materials. 
However,  this  is  not  the  case  since  some  amount  of  short  range  ordering  will  persist. 
Since  it  is  sufficient  for  chains  to  be  a  minimum  of  three  to  four  lattice  spacings  long  to 
form  a  stable  Frenkel  defect,  a  large  number  of  chains  would  be  possible.  However, 
under  identical  conditions  resistivity  measurements  at  low  temperatures  do  indeed 
show  that  the  disordered  phase  is  more  radiation  resistant  than  the  ordered  one. 
Finally,  figure  2d  shows  the  even  more  extreme  situation  encountered  in  metallic 
glasses.  In  this  case  besides  composition  disorder,  one  encounters  spatial  disorder  in  the 
atomic  arrangements.  Here  also  there  is  evidence  (Egami  1 983)  for  chemical  sro,  and  on 
the  scale  of  replacement  sequence  lengths  (a  few  atomic  spacings)  the  spatial  disorder 
will  not  have  a  significant  influence,  particularly  when  one  views  the  structure  on  the 
basis  of  a  network  model  (Gaskell  1980;  Krishan  1982b).  However,  as  we  will  see  in  the 
next  section  the  vacancies  will  collapse  in  about  10” 10  sec  being  unstable  in  such 
amorphous  systems. 

2.2  Defects  and  their  mobility  in  concentrated  alloys 

We  have  already  encountered  two  defects  namely  ‘structural  vacancies’  and  ‘antistruc¬ 
ture’  defects  which  exist  when  we  move  away  from  the  stoichiometric  compositions  in 
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alloys  like  NiAl,  CoGa,  CuZn,  AuCd.  Interstitialcy  transport  via  the  dumb-bell  type 
configuration  has  been  identified  as  a  possible  mechanism  for  transport  though  there  is 
only  limited  experimental  evidence.  Vacancy-interstitial  complexes  also  appear  to  play 
an  important  role  in  some  alloys  like  Fe-Cr-Ni  containing  impurities.  On  the  other 
hand,  in  systems  like  metallic  glasses  even  less  is  known. 

2.2 a  Interstitialcy  mechanisms :  Our  understanding  of  defects  in  concentrated  alloys 
comes  mainly  from  the  experience  in  dilute  alloys  and  pure  metals.  Here  the  self¬ 
interstitial  is  known  to  exist  as  a  dumb-bell  along  <  100  )  direction  in  fee  lattices.  In  a 
concentrated  alloy  A,  B  this  picture  would  involve  three  possible  dumb-bells  A-A,  A-B 
and  B-B.  If  their  binding  energies  and  migration  energies  are  about  the  same  they  would 
be  mobile  and  would  freely  change  their  form  as  shown  in  figure  3.  Due  to  such 
continuous  exchanges  the  identity  of  the  interstitial  is  lost  though  atomic  mobility  can 
take  place.  Identification  of  such  a  process  experimentally  is  difficult  since  the 
behaviour  is  very  similar  to  a  self-interstitial  in  a  pure  metal  and  produces  no  additional 
structure  in  resistivity  measurements.  In  principle  such  a  complex  could  lead  to 
segregation  though  there  is  no  direct  experimental  evidence  for  this.  Such  a  mechanism 
is  believed  to  operate  in  alloys  like  Cu  Ni  (Wollenberger  1982)  possibly  in  stainless  steel 
Fe-Cr-Ni,  and  alloys  of  Cu  with  Co,  Fe,  Zn  and  Ge  (Robrock  1983). 

A  more  complex  situation  arises  if  the  binding  energies  for  the  A-A,  A-B  and  B-B 
complexes  are  relatively  different  and  so  are  their  migration  energies.  Three  possibilities 
exist  as  shown  in  figure  4a  depending  on  cot  the  trapping  and  a>d  the  detrapping  jump 
frequencies.  In  this  case  the  ability  of  the  defect  to  move  becomes  strongly  dependent 
on  the  paths  available  as  shown  in  figure  4b.  Bocquet  (1983)  showed  that  when  the 
trapping  is  infinitely  deep  a>t  >  cod,  the  ability  for  the  defect  to  percolate  depends  on  the 
solute  concentration.  If  the  solute  concentration  CB  is  lower  than  a  limiting  percolation 
value  pc  ~  0-29,  the  defect  is  trapped  in  clusters  of  B-atoms  which  are  entirely 
surrounded  by  A-atoms  and  the  long  range  migration  of  the  defect  becomes  impossible. 
Internal  friction  experiments,  done  by  Halbwachs  and  Beretz  (1980),  show  that  in  AgZn 
the  interstitial  mobility  is  smaller  than  the  mobility  of  vacancies.  Generally  strong 
trapping  effects  are  observed  when  an  oversized  “impurity”  atom  is  involved  or  an 
undersized  ‘impurity’  is  involved.  A  detailed  discussion  of  this  has  been  given  in  the 
review  by  Robrock  (1983). 

2.2 b  Minor  alloying  addition  in  Fe-Cr-Ni  alloys :  Fe-Cr-Ni  alloys  are  technologically 
important  and  elements  like  Ti,  Si,  Mo  have  been  used  to  control  their  swelling 
behaviour.  Sharma  et  al  (1978)  and  Dimitrov  et  al  (1981)  have  obtained  some 


Figure  3.  Changes  in  the  dumb-bell  configurations  as  it  migrates  (a)  B-B  dumb-bell 
changing  to  (b)  A-B  dumbell  and  (c)  A-A  dumb-bell.  Transport  by  such  interstitialcy 
mechanism  also  causes  structural  rearrangements. 
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Figure  4.  (a)  Relative  binding  and  migration  energies  which  can  lead  to  transport  by  (1)  B-B 

(2)  A- A  &  A-B  and  (3)  A-B  dumb-bells  a>d  and  cot  are  the  dissociation  and  trapping  frequencies, 
(b)  Percolation  of  a  B-B  dumb-bell  at  low  A-atom  concentrations  and  no  percolation  at  high  A- 
atom  concentrations.  Percolation  paths  depend  on  the  alloy  concentration  and  effect  the 
atomic  mobility  by  interstitialcy  mechanism. 


Figure  5.  Resistivity  change  by  minor  alloy  elements  indicating  different  trapping  processes 
(Sharma  et  al  1978). 


information  about  defect  trapping  mobilities  with  these  solutes,  using  low  temperature 
resistivity  technique.  The  main  results  are  shown  in  figure  5.  At  low  temperatures  till 
190  K  the  decrease  in  the  resistivity  is  due  to  the  recombination  of  close  pair  Frenkel 
defects.  Between  190K  and  260  K  the  transport  by  the  interstitialcy  mechanism  is 
postulated  in  pure  Fe-Cr-Ni.  However,  from  the  shift  in  the  temperature  for  Si  and  Mo, 
it  follows  that  they  strongly  trap  the  interstitials  while  Ti  does  not.  In  the  temperature 
range  260  K  to  460  K,  vacancies  also  become  mobile  and  are  strongly  trapped  by  Ti  and 
weakly  by  Si  and  Mo.  At  temperatures  above  460  K  the  release  of  interstitials  and 
vacancies  from  Si  and  Ti  atoms  is  responsible  for  the  resistivity  decrease.  These  features 
are  in  general  agreement  with  the  swelling  behaviour  in  these  alloys.  It  is  perhaps  worth 
pointing  out  that  the  contribution  to  resistivity  in  concentrated  alloys  comes  from  the 
defect  concentrations  and  structural  changes. 
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2.2 c  Defects  in  amorphous  alloys :  There  is  even  less  understanding  about  detects  in 
amorphous  alloys  more  commonly  known  as  metallic  glasses.  The  glassy  phase  always 
has  a  density  lower  than  the  crystalline  phase  indicating  that  there  should  be  a  large  free 
volume  available.  Experiments  have  failed  to  show  any  presence  of  large  atom  sized 
holes  which  could  be  called  ‘vacancies’  and  it  is  therefore  thought  that  this  free  volume 
is  distributed  amongst  the  atoms.  Irradiation  would  produce  instantaneous  ‘vacancies’ 
due  to  displacement  of  atoms  but  such  vacancies  exist  for  only  ~  1 0  “ 1 0  sec  as  shown  by 
computer  simulation  (Bennett  et  al  1979)  and  this  volume  gets  distributed  (figure  6a). 
However  very  low  temperature  resistivity  experiments  show  some  structure  and  the 
possibility  that  they  may  be  stable  in  some  complex  form  cannot  be  ruled  out.  Cahn 
(1978)  on  the  basis  of  diffusion  data  argued  that  a  duality  picture  involving  both 
distributed  volume  and  atom-sized  holes  may  be  required.  A  theoretical  justification  for 
this  has  been  given  by  Krishan  (1982b)  on  the  basis  of  a  random  network  model  where 
the  distributed  volume  could  exist  as  a  configurational  lock  which  could  be  thermally 
activated  to  form  an  atom-sized  hole  (figures  6b  and  c).  Such  a  ‘hole’  would  migrate  but 
would  not  be  detectable  since  it  would  collapse  into  a  configuration  lock  in 
—  10“ 10  sec.  The  presence  of  such  ‘holes’  is  required  to  explain  atomic  transport  which 
appears  to  have  the  same  temperature  dependence  as  in  crystalline  metals  where  the 
vacancy  mechanism  is  involved.  On  the  other  hand,  there  is  not  so  much  difficulty  in 
identifying  interstitials  since  they  would  reside  in  the  interatomic  spaces  of  the 
disordered  network  (shown  in  figures  6b  and  c).  They  cannot  form  a  part  of  the  network 
since  such  a  process  would  make  the  local  atomic  density  to  become  more  than  that  of 
the  crystalline  solid,  and  there  seems  to  be  no  evidence  for  this.  Though  the  structure 
and  properties  of  interstitials  are  not  understood,  the  diffusion  constant  for  light  gases 
like  He  seems  to  be  very  similar  to  that  in  crystalline  solids. 

There  is  no  evidence  at  all  for  any  extended  defects  and  electron  microscopy  does  not 
reveal  any  structures  which  could  be  equivalent  to  dislocations,  precipitated  phases  or 
grain  boundaries.  Theoretically  the  possibility  of  disclinations  as  stable  defects  has  been 
postulated  by  Rivier  (1979)  but  there  is  no  experimental  evidence  for  this.  However  slip 
bands  have  been  observed  in  mechanically  deformed  metallic  glasses. 

2.3  Atomic  mixing  and  restructuring  in  alloys 

Besides  displacement  of  atoms,  irradiation  also  rearranges  the  atoms  during  replace¬ 
ment  sequences  since  atoms  get  shifted  in  the  process.  This  rate  is  often  even  greater 


Figure  6.  Schematic  diagram  showing  (a)  lower  density  of  the  glassy  phase  (b)  The  same 
atoms  are  shown  on  the  basis  of  an  interpenetrating  network  model  with  a  displacement  event 
a  ‘vacant  site’  and  an  interstitial  (arrow)  in  the  internetwork  position  (c)  The  vacant  site 
collapses  in  ~  10“ 10  sec  forming  a  configuration  lock  distributing  the  vacant  volume. 
Structure  in  (c)  by  thermal  activation  can  produce  the  structure  in  (b)  showing  the  duality 
behaviour  of  the  ‘atom  sized  hole’  and  distributed  volume  (Krishan  1982b). 
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than  the  displacement  rate;  the  ratio  e  being  about  2  for  electron  irradiation  and  80  for 
fast  neutron  irradiation  in  Ni3Mn.  Also  such  rearrangements  can  be  caused  by  cascades 
since  the  atoms  during  the  collapse  of  the  cascade  will  be  rearranged.  The  extent  of 
rearrangement  or  disorder  depends  on  the  state  of  the  alloy.  In  case  there  is  only  short 
range  clustering,  then  the  changes  can  be  quantified  in  terms  of  the  Warren-Cowley 
short  range  ordering  (sro)  parameters  a,  .  These  parameters  express  the  number  of  like 
atoms  which  on  an  average  would  be  in  i  =  1,  2  .  .  .  neighbouring  shells.  On  the  other 
hand  if  the  original  state  is  ordered  with  atoms  prescribed  to  fixed  a,  ft  sublattices  in  the 
ideal  state,  then  the  degree  of  order  is  quantified  by  the  Bragg-William  lro  parameter  S 
defined  by 

S  =  (fAa-XA)/(l-XA\  (3) 

fA  is  the  probability  of  finding  an  /1-atom  on  an  a-sublattice  and  X  A  is  the  average  A  -atom 
concentration.  From  purely  statistical  considerations  the  disordering  rate  for  lro  is 


(dS/dr)irr  —  - eKS , 


where  K  is  displacement  rate.  The  changes  in  sro  and  lro  by  irradiation  are  studied  by 
diffuse  and  small  angle  x-ray  and  neutron  scattering  techniques.  Such  measurements 
reveal  the  basic  mechanisms  operating  in  the  alloy.  The  irradiation  effect  can  be  seen  in 
terms  of  the  average  distance  an  interstitial  atom  is  moved  from  its  vacancy  in  a 
replacement  sequence.  This  can  be  expressed  by  W  (r  —  r')  which  is  the  normalized 
probability  for  producing  A  (or  B )  interstitial  at  a  distance  (r  —  r').  The  spatially 
dependent  dose  rate  KA  (r),  KB  (r)  for  interstitial  production  is  given  by 


KAB(r)  =  aa 


A,B 


W(  r-r'K^(r')dV 


where  <rA  B  is  the  displacement  cross-section  for  A,  B  atoms  and  a  the  proportionality 
constant.  One  can  get  an  idea  about  the  restructuring  of  the  alloy  if  we  take  the  Fourier 
transform  W  (k)  of  W( r  —  r')  and  decompose  it  into  correlation  effects  of  different 
wavelengths.  This  gives  some  idea  about  the  state  of  lro  which  will  be  most  affected 
since  these  would  have  a  wavelength  comparable  to  the  dominant  wavelength  in  W(k) 
(Abromeit  and  Krishan  1984). 


3.  Structural  changes  in  alloys 

Alloys  under  irradiation  are  essentially  non-equilibrium  systems.  They  develop 
concentration  gradients  in  point  defects  and  complexes  formed  by  irradiation.  These 
gradients  cause  fluxes  of  atoms  and  violate  the  thermodynamic  requirement  that,  at 
equilibrium,  the  chemical  potential  of  each  species  is  constant  throughout  the  system. 
Thus  these  fluxes  produce  segregation  and  restructuring  of  the  alloy  resulting  in 
development  of  new  phases,  changes  in  the  short  range  and  long  range  order  and  new 
types  of  sinks  to  maintain  a  steady  state.  In  this  section  we  give  examples  of  these  effects. 

3.1  Process  of  segregation  and  precipitation 

Radiation-induced  segregation  has  been  extensively  studied  because  of  its  important 
technological  implications.  For  example,  austenitic  grains  of  stainless  steel  can 
transform  entirely  into  ferritic  during  neutron  irradiation  (Potter  1983),  because  of  Ni 
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segregation  towards  and  chromium  segregation  away  from  grain  boundaries.  Also 
despite  small  concentration  of  silicon  in  stainless  steels,  it  tends  to  segregate  to  grain 
boundaries  to  form  intermetallic  silicide  phases  like  /~Wi3Si  which  have  a  profound 
influence  on  the  in  service  performance  of  components.  Segregation  essentially  occurs 
via  two  processes.  The  inverse  Kirkendall  effect  where  a  gradient  in  vacancy 
concentration  near  a  sink  results  in  a  reverse  flux  of  A  and  B-atoms,  the  atom  diffusing 
faster  leaves  behind  a  region  rich  in  the  slower  atomic  species.  The  second  process 
involves  the  diffusion  via  the  defect-solute  complex.  The  complex  diffuses  in  its  own 
concentration  gradient  which  could  be  opposite  of  that  of  the  solute.  A  criteria  for  such 
a  process  is  that 

jt?  complex  _j_  £  defect  ^  J7  complex 

where  Eb  and  Em  refer  to  the  binding  and  migration  energies  of  the  defects  involved. 

Converse  processes  like  the  dissolution  of  existing  precipitates  are  also  known  to 
occur.  This  has  been  observed  in  Ti-modified  stainless  steel  containing  y'  precipitates  by 
Lee  et  al  (1980)  after  neutron  irradiation.  The  electron  micrograph  in  figure  7 
(Varatharajan  et  al  1982)  shows  another  example  of  dissolution  of  y'  precipitates 
Ni3(Al,  Ti)  after  50keV  helium  ion  irradiation  where  the  fragmentation  effects  are 
clearly  evident.  There  are  various  processes  by  which  dissolution  can  occur.  For 
example  a  phase  with  a  positive  volumetric  mismatch  with  respect  to  the  matrix  will 
tend  to  have  a  bias  for  vacancies  and  hence  will  establish  a  gradient  of  those  atomic 
species  which  migrate  by  vacancy-mechanism.  Another  mechanism  of  dissolution  has 
been  postulated  by  Nelson  et  al  (1972)  where  the  solute  atoms  are  ejected  from  the 
precipitates  by  impinging  energetic  pka’s.  The  recoil-induced  concentration  of  solutes 
may  exceed  the  equilibrium  concentration  and  thus  would  start  a  new  nucleation  for 
the  precipitate.  Thus  it  appears  that  the  original  precipitate  is  fragmented.  The  different 
stages  of  this  process  are  schematically  shown  in  figure  8. 

3.2  Short-range  clustering  and  long-range  decomposition  of  Ni-C*u  alloy 

The  Ni-Cu  alloy  has  been  extensively  studied  and  shows  complete  miscibility  in  the 
solid  state  at  temperatures  well  above  600  K  where  thermal  diffusion  is  high  for  mass 
transport.  At  around  600  K  thermally-activated  mass  transport  becomes  sluggish  and 


Figure  7.  Dissolution  and  fragmentation  of  y'  precipitates  Ni3(Al,  Ti)  produced  by  low 
energy  a  particles,  (a)  unirradiated  (b)  irradiated  in  nimonic  90  alloy  (Varatharajan  et  al  1982). 
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Figure  8.  Schematic  representation  of  fragmentation  and  reprecipitation  of  precipitates  at 

different  stages. 

has  a  tendency  to  show  short-range  clustering  (Vrijen  1977).  On  the  basis  of  this 
spinodal  decomposition  (see  the  review  by  de  Fontaine  1979)  is  predicted  but  has  never 
been  observed  because  of  low  atomic  mobility.  Wagner  et  al  (1980, 1982)  have  shown  by 
small-angle  neutron  scattering  experiments  on  62Ni-41  %  64Cu  alloy  that  in  the 
temperature  range  373-480  K  a  long-range  periodic  composition  fluctuation  can  be 
produced  by  electron  irradiation,  with  a  wavelength  of  about  45  A.  Two  points  of  view 
can  be  adopted.  The  role  of  the  irradiation  is  only  to  enhance  diffusion  but  the  kinetics 
is  determined  by  the  Cahn  (1966)  and  Cook  (1970)  theory  of  spinodal  decomposition. 
In  this  theory  the  increase  in  the  intensity  of  the  fluctuation  of  wavevector  k  is  governed 
by  the  ‘amplification  factor’  atherm  (k2).  This  is  positive  inside  the  spinodal  due  to  a 
change  in  sign  of  the  second  derivative  of  the  free  energy.  Such  a  model  however  is  not 
completely  consistent  with  the  experimental  results  (Wagner  et  al  1982). 

The  second  point  of  view  is  to  consider  the  decomposition  as  a  radiation-induced 
process.  This  possibility  has  been  examined  by  Krishan  and  Abromeit  (1984)  and 
Abromeit  and  Krishan  (1984).  In  the  presence  of  irradiation  there  is  radiation- 
enhanced  vacancy  and  interstitial  diffusion.  It  would  appear  that  the  vacancy 
mechanism  will  contribute  to  an  enhancement  of  the  thermodynamic  decomposition 
while  the  interstitial  diffusion  would  effect  the  irradiation-induced  processes.  In  the 
simplest  case  where  point  defect  annihilation  is  only  by  recombination,  the  vacancy 
concentration  would  be  the  same  order  as  that  for  interstitials.  However  due  to  the 
much  higher  diffusion  constant  of  the  latter,  the  interstitialcy  transport  mechanism  will 
dominate.  The  thermodynamic  amplification  factor  will  now  be  replaced  by  a  new 
effective  amplification  factor. 

a=ir(k2)  =  a.herm(k2)  +  airr<k2)-  (6) 

The  contribution  to  cijrr(k2)  comes  from  two  effects  which  have  already  been  discussed, 
viz,  the  spatial  dependence  of  B  which  produces  fluxes  of  A  and  B  atoms  contributing 
to  restructuring  of  the  alloy  and  the  mixing  effect  discussed  in  §3.3  which  influence 
fluctuations  of  wavevector  k  depending  on  !F(k). 

3.3  Order-disorder  transformations  in  NiAMo 

The  alloy  Ni4Mo  shows  a  wide  range  of  order-disorder  transformations  and  is 
therefore  a  good  example  of  such  processes.  It  has  been  extensively  studied  by  Banerjee 
et  al  (1984)  over  a  wide  range  of  temperatures  50  K  ^  T  ^  1050  K  in  a  high  voltage 
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electron  microscope.  During  irradiation  we  have  already  discussed  some  of  the 
replacement  processes  by  which  the  alloy  disorders  and  loses  its  short  range  and  long 
range  order.  The  process  of  irradiation  is  non-equilibrium  in  nature  and  eventually 
there  are  competing  ordering  mechanisms,  and  the  alloy  attains  some  equilibrium 
structure  depending  on  temperature.  Interestingly,  the  final  state  to  which  an  alloy 
develops  seems  to  be  very  sensitive  to  the  initial  state  of  the  alloy  as  shown  by  figure  9 
where  the  transition  from  the  long  range  ordering  stage  to  short  range  ordering  is 
forbidden.  An  ordering  mechanism  map  which  has  been  developed  in  terms  of  the  long 
range  ordering  parameters  and  the  experimentally  determined  short  range  ordering 
reflection  intensity  ratio  R  is  shown  in  figure  10,  which  gives  the  broad  features  of  the 
effects  in  this  alloy. 

A  detailed  understanding  of  all  the  processes  responsible  for  these  order-disorder 
effects  is  not  available.  However,  the  formation  of  the  different  regions  appear  to  be 
related  to  the  different  processes  by  which  atomic  mobility  can  occur.  At  temperatures 


DISORDER 


SRO 

Forbidden 


Figure  9.  Allowed  and  forbidden  transitions  in  order-disorder  transformations. 


Region  A  =  destruction  of  LRO. 

Region  B  =  no  significant  change  if  initial  state  cor¬ 
responds  to  S  =  1  or  S  =  0. 

Region  C  —  continuous  ordering  by  decay  of  the 
SRO  waves  and  simultaneous  amplifi¬ 
cation  of  the  LRO  waves. 

Region  D  =  initially  as  in  Region  C.  After  SRO  is 
destroyed  completely  the  degree  of  LRO 
increases  further  by  domain  formation 
and  growth. 

Region  E  =  nucleation  and  growth  of  Dia  domains. 

Region  F  =  destruction  of  SRO. 

Region  G  =  reduction  of  the  degree  of  SRO  and  at 
T  >  550  K  transition  to  LRO. 

Region  H  =  development  of  SRO.  Data  for  the 
build-up  of  SRO  in  the  completely  disor¬ 
dered  material  is  so  far  only  available  for 
200 K  <T  <  450K. 

Region  I  =  destruction  of  SRO  and  transition  to 
LRO. 


Figure  10.  Radiation  ordering  Map  in  Ni4Mo  by  electron  irradiation.  S  is  the  long  range 
order  parameter  and  R  a  measure  of  short  range  order  (Banerjee  et  al  1984). 


Radiation-induced  structural  changes  in  alloys 


373 


T  <  200  K  there  is  very  little  mobility  of  atoms  apart  from  that  provided  by  radiation- 
induced  diffusion.  The  next  two  critical  temperatures  from  the  alloy  ordering  map  in 
figure  10  are  T*  (sro)  ~  200  K  and  T*  (lro)  ~  450  K.  It  appears  that  above  T*  (sro) 
the  short  range  ordering  processes  are  restored  in  the  alloy  due  to  diffusion  via  an 
interstitialcy  mechanism  involving  Mo  atoms.  This  is  conjectured  from  the  fact  that  the 
Mo  atom  is  considerably  larger  than  the  Ni  atom  which  is  known  to  be  mobile  at 
somewhat  lower  temperature.  The  temperature  T*  (lro)  is  connected  with  mobility  of 
atoms  via  the  vacancy  mechanism.  Banerjee  and  Urban  (1984)  have  done  a  detailed 
calculation  based  on  this  process  where  they  have  considered  the  various  jump 
mechanisms  from  the  ffff  fi-ct  and  a-a  sub-lattices  which  are  occupied  by  the  Ni  and  Mo 
atoms.  These  jumps  lead  to  ordering  of  the  alloy  and  provide  an  explanation  of  the  high 
temperature  behaviour  in  ordering  mechanism  map. 

3.4  Irradiation  effects  in  metallic  glasses 

Metallic  glasses  are  a  rather  extreme  example  of  alloys  where  the  structure  is  both 
spatially  and  compositionally  random.  Though  a  large  number  of  alloys  have  been 
produced  in  the  glassy  phase,  metal-metal  glasses  like  Ni60Nb40  and  Ni64Zr36  are 
better  suited  for  radiation  environments  because  of  the  low  neutron  capture  cross- 
sections  and  high  crystallization  temperatures.  Also  they  are  expected  to  show  high 
radiation  damage  resistance  because  practically  no  restructuring  effects  would 
influence  these  alloys! 

These  expectations  have  to  some  extent  been  borne  out  by  experiments.  For  example 
there  has  been  no  direct  observation  of  void  formation  for  which  the  existence  of  biased 
sinks  like  ‘dislocations’  are  required.  Rechtin  et  al  (1978)  have  studied  Ni60Nb40  using 
3  MeV  Ni+  ions  and  have  reported  a  high  stability  of  the  amorphous  phase  and  no 
observation  of  voids  using  tem.  Also  Cahn  et  al  (1981)  reported  on  neutron  irradiation 
of  Ni-Zr  glasses,  they  found  Ni64Zr36  to  be  highly  stable  whereas  alloys  with  higher  Zr 
concentrations  (Ni33Zr67)  partially  crystallized.  Detailed  studies  have  been  made  of  the 
partial  atomic  structure  factors  of  metallic  glasses  after  irradiation  using  x-ray 
scattering.  Also  other  techniques  like  positron  annihilation  and  resistivity  have  been 
used  but  nothing  specific  could  be  concluded  regarding  the  damage  structure  (see  the 
review  by  Nandedkar  and  Tyagi  1984). 

The  situation  under  low  energy  irradiation  using  gaseous  ions  is  somewhat  different. 
These  irradiation  conditions  broadly  fall  in  the  domain  of  fusion  reactors.  Here  one 
finds  that  though  metallic  glasses  are  more  resistant  to  damage  effects  than  crystalline 
alloys,  their  behaviour  shows  a  similar  pattern  (Tyagi  et  al  1983a,  b,  1984).  Gases  like 
helium,  hydrogen  and  argon  have  been  found  to  precipitate  into  gas  bubbles  which 
ultimately  leads  to  blistering  and  flaking.  Figure  1  la  shows  the  precipitation  of  helium 
into  bubbles  20-50  A  in  size  at  a  very  high  density  of  ~  1019  cm-3  in  Ni60Nb40  glass. 
Figure  11b  shows  blistering  in  Fe40Ni40P14B6  and  figure  11c  shows  flaking  in 
Ni60Nb40  glasses.  Measurements  of  the  critical  dose  for  blistering  show  that  it  is 
50-100%  higher  in  the  glassy  phase  than  in  the  corresponding  crystallized  phase. 

The  reasons  for  this  increased  resistance  of  metallic  glasses  are  not  very  clear  and  why 
the  damage  structure  should  eventually  be  so  similar  to  crystalline  materials  during 
gaseous  ion  bombardment  only  is  also  rather  surprising  in  some  respects.  The  higher 
resistance  of  metallic  glasses  to  gaseous  ion  implantation  effects  appears  to  be  related  to 
the  lower  ability  of  such  glassy  materials  to  retain  insoluble  gases  like  helium  as 
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compared  to  crystalline  materials  where  they  can  be  easily  trapped  in  vacancies. 
Though  metallic  glasses  have  1-2%  more  free  volume  (equivalent  of  ~  102O-1021 
vacancies!)  than  the  crystalline  phase,  this  as  we  have  discussed,  is  distributed  over 
several  atoms  and  is  not  effective  in  trapping  helium  which  seeks  ‘atom-sized  holes’. 
However  since  gaseous  ion  implantation  is  accompanied  by  the  production  of ‘vacant 
sites’  and  ‘displaced  atoms’  the  helium  could  be  trapped  into  some  of  these  ‘vacant  sites’ 
before  they  collapse  in  ~  10~ 10  sec.  Once  of  course  helium  is  trapped  into  such  a  site  in 
a  dynamic  process,  it  would  stabilize  the  ‘vacancy’.  Such  a  kinetic  trapping  model  has 
been  examined  and  the  main  effect  of  the  vacancy  collapse  is  to  reduce  the  rate  of 
‘helium-vacancy’  complex  formation  (Krishan  et  al  1984a).  This  explains  the  higher 
critical  doses  for  blistering  which  are  observed.  Our  understanding  of  the  basic  damage 
processes  in  these  glasses  is  slowly  improving. 

4.  Phase  instability  by  irradiation 

For  a  variety  of  reasons  materials  in  radiation  environments  must  be  looked  upon  as 
non-equilibrium  systems  even  though  they  may  be  in  a  steady  state  condition.  This  is 
because  they  are  open  systems,  continuously  subjected  to  a  flux  of  energetic  particles 
across  boundaries,  with  energies  often  107  times  more  than  the  thermal  energies  of  the 
atoms  in  the  solid.  Such  a  system  clearly  cannot  be  viewed  as  a  thermodynamic  system 
whose  final  equilibrium  state  is  governed  by  minimum  entropy  or  energy  consider¬ 
ations  but  not  the  path  it  takes  to  reach  this  state.  Also  the  processes  involved  in 
irradiation  are  basically  irreversible  since  irradiation  produces  point  defects  which  are 
‘dissipated’  at  sinks  with  no  reverse  fluxes.  Thus  the  basic  principle  of  detailed  balance, 
where  microscopic  processes  must  proceed  at  the  same  rate  both  ways,  is  violated. 

4.1  Mechanism  of  instability 

Application  of  non-equilibrium  methods  to  the  behaviour  of  metals  under  irradiation  is 
of  relatively  recent  origin.  Non-equilibrium  systems  maintained  at  steady-state  can 
undergo  non-equilibrium  transitions  when  some  control  parameter  crosses  a  certain 
critical  value.  The  view  that  all  irradiation  effects  like  segregation,  precipitation  and 
ordering  are  simple  consequences  of  growth  or  shrinkage  by  point  defect  fluxes  cannot 
be  maintained,  since  the  highly  nonlinear  nature  of  the  processes  could  drive  the  system 
to  new  nonequilibrium  states. 

As  a  simple  illustrative  example  consider  the  growth  of  ‘defects’  like  precipitate  or 
voids  due  to  a  steady  flux  of ‘point  defects’  which  is  sustained  by  the  irradiation.  In  such 
a  situation,  a  balance  will  be  maintained  between  the  steady-state  concentration  of 
‘point-defects’  and  the  size  and  concentration  of  ‘defects’  as  it  evolves.  It  is  important 
that  this  balance  should  be  a  stable  one  i.e.  if  we  change  one  of  these  variables,  the  system 
must  attempt  to  come  back  to  its  earlier  steady  state.  This  can  be  quantified  in  terms  of 
the  time  or  frequency  |co,-|  =  1/t i9  it  takes  for  it  to  return  to  equilibrium,  i  being  the 
variable  which  is  disturbed,  or  the  degree  of  freedom  of  the  system.  If  this  disturbance  is 
localized  then  it  could  be  decomposed  into  a  disturbance  involving  several  wave  lengths 
or  wave  vectors  by  Fourier  expansion.  Each  of  these  wave  vectors  will  have  its  own 
response  time  or  frequency  |co,  (k2)|.  These  frequencies  of  course  depend  on  the  basic 
system  parameters  like  the  sink  strengths,  recombination  constants,  dose  rate,  mixing 
effect  etc  and  this  can  be  explicitly  shown  as  a  dependence  on  the  system  variable  s  by 
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|co,(s,  k2)|.  Near  the  steady  state  the  approach  to  equilibrium  is  exponential  in  time  t  so 
that  the  system  will  re-establish  its  equilibrium  as 

~  exp  [<uf  (s,  k2)t ]  .  (7) 

An  interesting  situation  can  arise  if  for  a  given  k  value  co,(s,/c2)  becomes  zero  for  5  =  sc , 
a  critical  value.  This  would  mean  that  the  time  t,(s,  k2)  taken  for  wavelengths 
fluctuations  k  to  return  to  their  original  steady  state  would  be  infinite.  In  short  the 
system  would  never  return  to  its  original  steady-state  but  will  bifurcate  to  a  new  non¬ 
equilibrium  state  and  a  new  structure  will  evolve.  For  a  stable  system  all  c k2)  are 
negative  implying  a  decay  to  steady  state. 

4.2  Physical  systems 

Several  examples  of  such  processes  have  been  considered  in  recent  years.  Barbu  et  al 
(1981)  discussed  a  precipitation  instability  in  undersaturated  Ni-Si  alloy.  They  found 
that  there  is  a  threshold  value  of  dose  rate  which  depends  on  temperature. 

A  similar  instability  has  also  been  reported  in  AlZn  alloy  by  Cauvin  and  Martin 
(1983).  Another  interesting  application  of  this  theory  is  the  void-lattice  formation. 
Voids  are  initially  disordered  but  after  certain  time  orders  into  a  lattice  with  the  same 
symmetry  as  that  of  the  host  matrix.  Besides  pure  metals  this  is  formed  in  a  number  of 
alloys  including  stainless  steel.  In  this  case  the  disorder  to  order  transition  occurs  at  a 
critical  value  of  the  sink  densities  (ps)c  and  ( pv\  for  voids  and  dislocations  like  vacancy 
loops  respectively. 

Taking  into  account  the  various  rate  processes  it  is  found  that  cof(sc,  k2)  =  0  for 
sc  —  [(Ps)c>  (pJc]  and  this  is  shown  in  figure  12.  The  value  of  the  wavelength  Xc  —  2n/kc 
in  this  model  depends  on  the  mean  free  path  of  point  defects  before  they  are  absorbed. 


4  (m'2) 


Figure  12.  Critical  sink  densities  ps,  pv  for  void  ordering  in  stainless  steel  (Krishan  1980, 

1982a). 
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The  long  range  ordering  under  electron  irradiation  in  NiCu  alloy  which  was  discussed 
earlier  would  involve  a  similar  process  since  the  initial  short  range  clustering  eventually 
gives  way  to  long  range  ordering  which  is  characteristic  of  such  dissipation  structures. 
In  this  case  the  mixing  effect  of  interstitials  is  identified  as  the  order  parameter 
(Abromeit  and  Krishan  1984). 


5.  Changes  in  properties  of  alloys 

Besides  radiation,  alloys  used  in  reactors  are  subjected  to  high  stresses  and  tempera¬ 
tures.  Their  stability  under  all  these  three  influences  must  be  ensured.  We  have  already 
seen  a  number  of  structural  changes  which  alloys  undergo  during  irradiation.  These 
changes  influence  the  mechanical  behaviour  and  generally  have  a  degrading  influence. 
It  is  therefore  not  sufficient  to  understand  the  changes  in  the  alloy  structure  by 
irradiation  but  also  to  correlate  these  with  its  changes  in  properties  like  creep,  swelling, 
ductility  and  fracture  behaviour.  In  table  1  some  of  these  features  have  been  tabulated. 
One  of  the  problems  in  such  studies  in  neutron  environments  is  the  difficulty  in 
monitoring  under  controlled  conditions  parameters  like  stress  and  temperature. 
Therefore,  accelerators  are  being  increasingly  used  for  such  metallurgical  investigations 
(Krishan  et  al  1984b)  and  we  will  give  a  few  illustrations  of  such  studies. 

In  §3  we  had  seen  from  resistivity  studies,  the  influence  on  defect  mobility  of 
elements  like  Si,  Mo  and  Ti  (figure  7).  Gessel  and  Rowcliffe  (1977)  discussed  the 


Table  1.  Effects  of  phase  changes  on  the  properties  of  structural  materials 


Phase  change 

Affected  properties 

Mechanism 

Formation  of  finely  dispersed 
precipitates  (e.g.,  y') 

Flow  properties  such  as  yield 
strength,  ultimate  strength  and 
work  hardening  rate 

(a)  Precipitation  hardening 

(b)  loss  of  solid  solution  hard¬ 
ening  and  (c)  changes  in  the  ki¬ 
netics  of  recovery  and  recrystal¬ 
lisation  processes. 

Swelling  and  irradiation  creep 

(a)  Dislocation  pinning 

(b)  removal  of  ‘swelling  inhibi¬ 
tors’  from  solution  (c)  preci¬ 
pitate  collector  effect. 

Formation  of  ‘massive’  phase 

Flow  properties  such  as  yield 
strength,  tensile  strength  and 
work  hardening  rate 

(a)  Second  phase  strengthening 

(b)  changes  in  the  kinetics  of  re¬ 
covery  and  recrystallisation 
processes. 

Swelling  and  irradiation  creep 

(a)  Loss  of  ‘swelling  inhibitors’ 
from  the  parent  phase. 

(a)  Formation  of  semicontinu- 
ous  or  continuous  films  along 
grain  boundaries  or  interphase 
boundaries 

Impact  toughness,  fracture 
toughness  and  ductility 

Propagation  of  cracks  along 
paths  having  low  interfacial 
energy. 

(b)  Segregation  along  grain 
boundaries  or  interphase 
interfaces 
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influence  of  these  and  other  elements  like  W,  Al,  Nb  and  C  on  Fe-7-5%  Cr-20%  Ni 
alloy.  Their  results  are  shown  in  figure  13a  and  show  a  marked  reduction  in  the 
swelling.  This  is  ascribed  to  the  trapping  of  vacancies  and  interstitials  by  the  impurity 
additions.  However,  these  effects  can  manifest  in  a  complex  way  since  segregation  and 
precipitation  could  remove  these  elements  from  the  matrix.  The  formation  of  y'- 
Ni3(Ti,  Al)  could  for  instance,  remove  Ti  and  Al  which  would  result  in  high  swelling 
rates  at  higher  doses.  This  is  demonstrated  from  the  work  of  Bajaj  et  al  (1980)  where  the 
peak  swelling  is  shown  to  increase  with  the  volume  fraction  of  y'  in  nimonic  pe16  alloy 
(figure  13b). 

The  influence  of  stress  on  swelling  has  been  a  matter  of  considerable  concern.  It  had 
so  far  been  difficult  to  establish  this  effect  and  to  make  a  systematic  study  because  of  the 
complex  loading  patterns  encountered  in  reactors.  Such  effects  have  been  recently 
studied  by  Khera  et  al  (1982)  and  Sahu  and  Jung  (1984)  using  light  ion  irradiation  from 
cyclotrons  in  the  energy  range  7-10  MeV.  The  studies  on  ‘pure  stainless  steel’  are  at  low 
doses  where  the  influence  on  the  early  stages  of  void  growth  can  be  studied.  Under 
tensile  stresses  ~  100  MPa  the  results  show  (figure  14)  a  marked  increase  in  the  void 
density.  It  is  interesting  to  conjecture  the  effect  of  a  compressive  and  a  tensile  stress  on 
void  nucleation.  As  seen  from  figure  14  the  void  size  distribution  is  significantly 
influenced  there,  being  larger  number  of  voids  with  larger  diameters.  This  is 
presumably  because  under  a  compressive  stress  void  nucleation  is  inhibited  while  a 
tensile  stress  aids  it. 

Similar  studies  on  the  irradiation  creep  behaviour  under  both  compressive  and 
tensile  stress  have  been  done  by  Sahu  and  Jung  (1984)  in  ‘pure  stainless  steel’  and  the 
results  for  compressive  stress  are  shown  in  figure  1 5.  The  strain  rate  attains  a  steady 
state  value  after  approximately  10  hrs  of  irradiation.  This  is  presumably  because  in  the 
early  stages  of  irradiation  the  alloy  restructuring  takes  place  and  sinks  like  voids 
develop.  This  region  is  the  primary  irradiation  creep  region. 

As  a  final  example  we  discuss  some  results  on  Fe-3-5  %  Ni  ferritic  steel  (A  203  grades). 
This  steel  has  been  irradiated  at  the  cirus  reactor  to  a  neutron  fluence  of  3*5 


IMPURITY  ADDITIONS  (atom*/.) 


Figure  13.  (a)  Influence  of  minor  alloying  elements  on  swelling.  The  swelling  ratio  is 
normalized  with  respect  to  the  ‘pure’  alloy  takeii  as  unity  (after  Gessel  and  Rowcliffe  1977)  (b) 
Effect  of  y'  precipitation  on  swelling  due  to  removal  of  minor  alloying  elements  (after  Bajaj  et 
al  1980). 
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Figure  14.  Comparison  of  void  size  distribution  during  compressive  and  tensile  stress  (Sahu 
and  Jung  1984)  Inset  shows  the  effect  of  dose  and  tensile  stress  on  void  swelling  (Khera  et  al 
1982).  Experiments  have  been  done  using  light  ion  beams  at  10  MeV  with  accelerators. 
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Figure  15.  Strain  due  to  irradiation  creep  produced  by  compressive  load  using  7  MeV 
protons  (Sahu  and  Jung  1984). 

x  1019ncm~2,  and  investigated  for  its  ductility  and  impact  toughness  under 
irradiation.  The  nil  ductility  transition  temperature  ndtt  should  remain  nearly  40°C 
below  the  operating  temperature  of  any  component,  ndtt  was  measured  by  impact 
energy  experiments  and  the  results  are  plotted  in  figure  16  for  the  irradiated  and 
unirradiated  sample  (Baldev  Raj  et  al  1983).  A  significant  increase  in  the  ndtt  is 
observed  from  a  value  of  —  91  °C  to  149°C.  Due  to  irradiation  the  flow  stress  of  a 
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TEST  TEMPERATURE  (  °C  ) 

Figure  16.  Shift  in  the  ndtt  temperature  by  low  dose  irradiation  in  cirus  reactor  of  Fe- 
3-5%  Ni  alloy  (Baldev  Raj  et  al  1983). 


material  is  increased  while  the  fracture  stress  is  reduced.  Field  ion  microscope  studies 
show  nucleation  of  very  small  loops  probably  arising  out  of  the  interaction  of  solute 
atoms  with  irradiation  produced  interstitials. 
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X-ray  photoelectron  spectroscopy  for  surface  film  analysis  in 
corrosion  research 

P  K  CHAUHAN,  H  S  GADIYAR  and  R  KRISHNAN 

Metallurgy  Division,  Bhabha  Atomic  Research  Centre,  Trombay,  Bombay  400085,  India 

Abstract.  Surface  films  on  metals  and  alloys  often  protect  them  from  reaction  with  the 
environment,  and  hence  a  knowledge  of  their  protective  properties  and  composition  could  be 
invaluable  for  predicting  their  corrosion  behaviour.  XPS  (x-ray  photoelectron  spectroscopy) 
could  provide  a  quantitative  analysis  of  the  chemical  composition,  the  nature  of  valence  states 
and  elemental  distribution  within  the  surface  films. 

The  present  paper  reviews  the  potential  of  this  technique  in  corrosion  studies.  A  brief  review 
of  the  work  done  on  the  passivation  of  iron  and  iron-chromium  alloys  and  on  the  inhibition 
studies  on  copper  base  alloys  has  been  given.  A  few  examples  of  investigations  carried  out  at 
authors’  laboratory  are  also  included.  An  attempt  has  been  made  to  establish  a  correlation 
between  the  compositions  of  the  films  formed  and  corrosion  behaviour  of  carbon  steel  in  10-5 
pH  lithium  hydroxide  solution  and  of  Cu-Ni  alloys  and  sacrificial  Al-Zn-Sn  alloys  in  synthetic 
sea-water. 

Keywords.  X-ray  photoelectron  spectroscopy;  surface  film;  passivation;  inhibition;  pitting; 
sacrificial  anodes. 

PACS  No.  81  60 

1.  Introduction 

Most  of  the  engineering  metals  and  alloys  possess  a  high  degree  of  corrosion  resistance 
due  to  the  presence  of  surface  films  which  are  often  adherent,  tenacious  and  resistant  to 
attack  by  the  environment.  The  surface  properties  are  very  important  as  every  type  of 
reaction,  be  it  solid-solid,  solid-liquid  or  solid  gas,  commences  at  the  surface.  For 
example  austenitic  stainless  steels  are  resistant  to  attack  due  to  the  presence  of  a 
chromium  rich  passive  layer.  Ni-Cr  alloys  (15  to  20%  Cr)  possess  high  temperature 
oxidation  resistance  due  to  the  formation  of  a  protective  Cr203  or  NiCr204  scale. 
Further,  corrosion  inhibition  studies  have  indicated  that  inhibitors  often  change  the 
surface  film  to  provide  better  protection.  A  mention  may  be  made  of  the  formation  of 
copper-benzotriazole  complex  on  copper  and  copper-nickel  alloys  in  the  presence  of 
benzotriazole  inhibitor  in  an  aqueous  medium. 

The  study  of  the  surface  composition  and  property  correlations  provides  basic 
guidelines  to  understand  the  various  types  of  corrosion  failures  like  localised  attack, 
intergranular  corrosion,  stress  corrosion  cracking  etc.  Though  other  conventional 
methods  of  corrosion  evaluation  like  weight  change,  electrochemical  measurements  etc 
have  been  in  use,  x-ray  photoelectron  spectroscopy  (xps)  has  emerged  as  one  of  the  most 
powerful  tool  for  the  study  of  composition  of  2-3  nm  thick  corrosion  films.  The 
technique  was  first  developed  by  Siegbahn  (1967)  who  had  earlier  shown  the  possibility 
of  distinguishing  from  the  photoelectron  spectrum,  an  element  in  different  valence 
state.  Since  then,  xps  has  been  utilised  in  a  variety  of  research  problems  in  a  number  of 
laboratories.  In  xps,  all  the  energy  of  the  incident  photon  is  expended  in  the  ejection  of 
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the  photoelectron.  Thus,  the  binding  energy  (Eg)  of  the  electron  with  respect  to  Fermi 
level  can  be  determined  by  the  relationship: 

hv  =  Fphoto  +  EB  +  Q,  (1) 

where  £photo  is  the  kinetic  energy  of  the  photoelectron  and  Q  is  the  work  function  of  the 
spectrometer  material.  The  energy  of  the  photon,  hv,  is  a  known  quantity.  EB  assumes 
different  values  for  different  electronic  shells  and  it  is  to  some  extent,  dependent  on  the 
chemical  state  of  the  atom.  Consequently,  the  energies  of  the  ejected  photo-electrons 
may  be  shifted  slightly  as  the  chemical  state  is  changed.  Shifts  in  core  binding  energies 
can  be  measured  and  related  to  the  chemical  environment.  Studies  of  the  satellite 
structure  in  the  spectra,  measurement  of  relative  intensities  as  well  as  energies  of 
photoelectron  peaks  and  determination  of  the  angular  distribution  of  the  expelled 
photo-electrons  could  provide  useful  information  about  the  chemical  species,  xps  has 
thus  attracted  the  attention  of  the  corrosion  scientists,  as  the  technique  is  capable  of 
analysing  few  monolayer  thick  films,  which  cannot  be  analysed  by  other  techniques. 
Thus,  the  action  of  inhibitors  and  the  phenomenon  of  passivation  can  now  be 
understood  in  a  much  better  way. 

As  already  mentioned  xps  spectrum  of  an  oxide  can  be  distinguished  from  that  of  the 
parent  metal.  In  many  cases,  it  is  possible  by  observing  the  chemical  shifts  (Castle  and 
Epler  1974;  Hoare  and  Talerman  1972;  McIntyre  and  Zetaruk  1977;  Asami  and 
Hashimoto  1977;  Allen  et  al  1974),  peak  shapes  (McIntyre  and  Zetaruk  1977),  multiplet 
splitting  (Carver  1972)  and  presence  or  absence  of  satellites  (Castle  1971).  Hydroxides 
(McIntyre  et  al  1976)  and  chlorides  (Belton  and  Clarke  1976;  Kishi  and  Ikeda  1974)  can 
also  be  identified.  Carbon  in  carbidic  form  can  also  be  distinguished  from  that  due  to 
hydrocarbon  contamination  (Ramqvist  1970).  One  difficulty  that  frequently  arises  in 
xps  characterization  of  corrosion  products  is  that  the  oxide  film  formed  in  aqueous 
solution  on  certain  metals  and  alloys  may  be  susceptible  to  further  atmospheric 
oxidation  (before  introducing  the  specimen  into  the  vacuum  chamber  of  the 
spectrometer)  and  one  may  end  up  with  erroneous  conclusions.  In  addition,  valence 
state  determination  of  the  oxide  is  meaningless  after  Ar+  ion  bombardment  unless  the 
oxide  has  been  checked  for  stability. 

2.  Brief  review  of  corrosion  research  using  xps 

2.1  Passivation  studies  on  iron  and  its  alloys 

Several  studies  have  been  reported  on  the  passivation  of  iron  and  iron-chromium 
alloys.  Roberts  and  Wood  (1977)  have  investigated  the  mechanism  of  oxidation  and 
passivation  of  iron  by  water  vapour  using  xps.  The  authors  have  studied  the  interaction 
of  water  vapour  with  iron  at  290  K  by  xps  and  have  found  clear  evidence  for  the 
passivation  of  surface.  The  O  Is  peak  observed  at  530  eV  has  been  assigned  to  the 
oxygen  chemisorption  or  a  surface  oxide  while  that  at  532  eV  to  the  OH  (adsorbed) 
species.  The  peak  at  534  eV  has  been  related  with  the  molecularly  adsorbed  water.  After 
having  established  the  presence  of  both  OH*5  -  and  O'5-  species,  the  authors  have  suggested 
that  the  first  step  is  the  dissociation  of  water  followed  by  interaction  between  the 
surface  hydroxyls  to  give  chemisorbed  oxygen: 

H20  -+  OH  (ads)  +  H  (ads); 

20H  (ads)  ->  2  0(ads)  +  H2  (g). 
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A  significant  feature  of  the  study  was  that  the  ratio  of  the  intensities  of  the  530  and 
532  eV  peaks  remained  similar  upto  the  maximum  water  exposures.  It  was  therefore 
suggested  that  a  discrete  chemical  phase  of  FeOOH  existed  on  the  surface  as  a  result  of 
passivation. 

xps  studies  on  the  electrochemical  passivation  of  chromium  (Bouyssoux  and  Romand 
1977)  in  0-5  M  H2S04  at  +  0-55  V/NHE  for  90  minutes  indicated  two  spin  doublets 
Cr  2 pl/2  and  Cr  2 p3/2.  The  higher  binding  energy  of  2 p3/2  peak  (575*6  eV)  has  been 
assigned  to  the  oxide  while  lower  binding  energy  (573*1  eV)  to  the  metal.  These  peaks 
have  been  compared  with  those  of  standard  oxides  of  chromium  and  it  has  been 
concluded  on  the  basis  of  O  Is  peak  at  5294  eV  that  the  formation  of  anhydrous  phase 
of  Cr203  takes  place  as  a  result  of  passivation. 

Asami  et  al  (1978)  have  reported  the  composition  of  passive  film  formed  on  an 
extremely  corrosion-resistant  amorphous  alloy  of  iron  (Fe-10  at.  %  Cr-13  at.  %  P-7 
at.  %  C),  in  1  N  HC1  as  a  result  of  passivation  by  immersing  for  168  hours  at  30°C.  The 
study  indicated  that  the  air-formed  film  on  the  alloy  mainly  consisted  of  ferric  oxy- 
hydroxide.  However,  on  immersing  in  HC1,  chromium  peak  at  577  eV  dominated  and 
the  traces  of  iron  in  the  film  existed  both  in  Fe2+  and  Fe3+  states.  The  chromium  peak 
at  577  eV  has  been  attributed  to  oxidised  chromium.  The  low  binding  energy  peak  of  O 
Is  spectrum  (metal-O  bond)  shifted  after  immersion  from  530*2  eV  to  530*6  eV.  The 
peak  at  530*2  eV  has  been  assigned  to  Fe-O  bonds  in  the  air-formed  film  while  that  at 
530*6  eV  to  Cr-O  bonds  in  the  passive  film.  Relatively  high  intensity  in  the  high  energy 
peak  of  O  Is  spectrum  of  the  air-formed  film,  which  has  been  assigned  to  large  amount 
of  OH  groups,  decreased  in  intensity  by  immersing  into  HC1.  The  authors  have 
assigned  this  observation  to  the  formation  of  a  homogeneous,  hydrated  chromium 
hydroxide  CrOx  (0H)rnH20  and  thus  causing  passivation  of  the  surface.  Asami  et  al 
(1978)  have  further  investigated  the  passivity  of  a  series  of  iron-chromium  alloys  in 
sulphuric  acid  reported  that  chromium  content  of  the  passive  film  increased  drastically 
at  ca  13  at.  %  Cr  of  the  bulk  alloy  composition  whereas  no  composition  change 
occurred  in  the  surface  of  the  substrate  alloy  immediately  under  the  passive  film.  The 
high  corrosion  resistance  of  iron-chromium  alloys  with  high  chromium  content  has 
been  attributed  to  the  protective  nature  of  hydrated  chromium  oxy-hydroxide  which  is 
the  main  constituent  of  passive  films  in  alloys  containing  chromium  at  least  1 2*5  at.  %. 

Chance  and  Gaarenstroom  (1980)  have  characterised  the  surface  films  formed  on  a 
ferritic  stainless  steel  (sae  type  J  409)  in  sodium  sulphate  solutions  of  differing  pH  as  a 
function  of  polarization  potential.  The  study  showed  that  the  critical  current  for 
passivation  was  extremely  sensitive  to  pH,  e.g.  with  0*5  M  sulphate  solutions  at  82  C,  the 
critical  current  for  passivation  increased  by  more  than  two  orders  of  magnitude  as  the 
pH  changed  from  3*0  to  2*8.  xps  investigations  indicated  that  composition  and  thickness 
of  a  surface  film  changes  significantly  during  anodic  polarization  but  the  most 
noteworthy  was  the  change  from  the  films  of  iron  and  oxygen  in  the  prepassive  region 
into  the  chromium  and  oxygen  films  in  the  passive  region.  The  films  which  were  initially 
made  of  Cr  and  O  did  not  change  above  primary  passivating  potential.  The  authors 
have  reported  thickening  of  films  in  the  prepassive  region  during  anodic  polarization 
and  then  thinning  of  the  films  on  attaining  passivity.  It  has  been  observed  that  the  film 
thickness  on  alloy  surfaces  at  open  circuit  potential  is  inversely  related  to  the  corrosivity 
of  the  solution  i.e.  film  thickness  decreases  as  the  corrosivity  of  the  solution  decreases. 
Film  thicknesses  in  the  above  investigation  have  been  determined  by  aes  depth  profiling 
technique,  the  discussion  of  which  is  outside  the  scope  of  this  paper. 
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2.1  a  Studies  conducted  in  barc  Correlation  between  the  corrosion  behaviour  and 
composition  of  the  oxide  film  formed  on  carbon  steel  in  105  pH  lithium  hydroxide 
solution  has  been  established.  Carbon  steel,  when  exposed  to  deaerated  105  pH  lithium 
hydroxide  solution  at  room  temperature  showed  negligible  corrosion  rate.  The  film 
formed  after  24  hours  was  examined  by  xps.  The  studies  in  this  section  and  elsewhere 
reported  in  the  paper  were  carried  out  using  phi  model  551  electron  spectrometer  with 
MgKa  source.  The  instrument  was  calibrated  with  respect  to  Au  4/7 /2  peak  at  83*8  eV.  C 
Is  peak  at  285  0  eV  due  to  hydrocarbon  contamination  was  used  to  account  for  the 
charging  effects.  High  resolution  spectra  were  recorded  using  pdp- 11/04  computer 
interfaced  with  the  system.  Chemical  states  of  the  metal  cations  in  the  surface  oxide 
films  were  determined  by  comparing  positions,  shapes  and  shifts  of  the  peaks  of 
different  elements  with  those  of  the  standard  spectra.  In  Fe  2 p  spectrum  (figure  1),  Fe 
2p3/2  and  Fe  2 p1/2  peak  maxima  occurred  at  710*8  eV  and  725-1  eV  (after  correction  for 
surface  charging).  Fe  2 p3/2  peak  for  Fe203  has  been  reported  to  shift  towards  higher 
binding  energy  by  4-2  eV  (Gimzowski  et  al  1977;  Yin  et  al  1972)  with  respect  to  pure 
metal  (706-8  eV)  while  a  shift  of  3  0  eV  has  been  estimated  for  Fe304  (Ertl  and  Wandelt 
1975;  Coad  and  Cunningham  1974).  Kishi  and  Ikeda  (1973)  during  oxidation  studies  of 
iron  observed  a  maximum  shift  of  4-0  eV. 

Allen  et  al  (1974)  have  reported  Fe  2 p3 /2  peaks  for  a-Fe203  and  a-FeOOH  to  appear 
at  71 1-4  and  71 1-0  eV  respectively  while  Asami  et  al  (1977)  report  the  Fe  2 p3/2  peak  for 
a-Fe203  at  7 1 0*97  eV,  for  a-FeOOH  at  711-44  eV  and  for  y-FeOOH  at  71 1-60  eV. 
McIntyre  and  Zetaruk  (1977)  report  the  values  of  71 10  ±015  eV  for  a-and  y-Fe203 
and  711-9  +  0-2  eV  for  a-FeOOH.  On  the  basis  of  these  reports,  it  can  be  concluded  that 


Figure  1.  Fe  2p  spectrum  for  carbon  steel  exposed  to  deaerated  10-5  pH  lithium  hydroxide 
solution. 


xps  in  corrosion  research 


387 


a  shift  of  4  0  eV  observed  by  us  of  the  Fe  3 p3/2  peak  with  respect  to  pure  metal 
indicates  iron  to  be  in  Fe  (III)  state. 

Figure  2  shows  the  O  Is  spectrum  which  appears  to  consist  of  two  peaks  of 
comparable  intensities.  The  peak  at  530  0  eV  arises  as  a  result  of  metal-oxygen  bond 
(hereafter  will  be  referred  to  as  OM  peak)  and  that  at  53 1  *6  eV  arises  as  a  result  of  metal- 
hydroxide  bonds  (referred  to  as  OH  peak  later  in  the  text).  Asami  et  al  (1977)  have 
reported  OM  and  OH  peaks  for  a-and  y-FeOOH  at  530-leV  and  at  531-4  eV 
respectively  while  McIntyre  et  al  (1977)  report  OM  peak  for  a-FeOOH  at  530-3 
±0-2eV  and  OH  peak  at  531-4  +  0-2eV.  On  the  basis  of  these  reports,  it  can  be 
concluded  that  the  oxide  film  formed  in  this  investigation  consists  of  FeOOH,  which 
protects  carbon  steel  from  corrosion  in  deaerated  10-5  pH  lithium  hydroxide  solution. 
However,  it  is  difficult  to  say  whether  FeOOH  formed  on  the  surface  is  in  a  or  y  form. 

2.2  Inhibition  studies 

Inhibition  studies  using  xps  have  mainly  been  reported  on  copper,  iron  and  nickel. 
Lewis  and  Fox  (1978)  estimated  the  thickness  of  inhibitive  films  formed  on  copper 
surfaces  immersed  in  solutions  of  benzotriazole  (bta)  by  three  methods  based  on  xps 
and  compared  with  the  results  obtained  using  a  fourth  method  viz  electrical  capacitance 
of  the  films.  The  methods  based  on  xps  involve  (a)  measurements  of  the  rate  of  removal 
of  the  film  by  argon  ion  sputtering,  (b)  observation  of  the  attenuation  of  the  substrate 
Cu  2 p  photoelecton  signal  on  passing  through  the  protective  film  and  (c)  a  calculation 
of  film  thickness  from  measurements  of  the  intensity  of  N  Is  photoelectron  signal  from 
the  surface  film. 


Figure  2.  O  Is  spectrum  for  carbon  steel  exposed  to  deaerated  10-5  pH  lithium  hydroxide 
solution. 
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In  procedure  (a),  the  authors  have  used  the  expression 

J.  S.  t/p  =  I.  S.  t/A.  e.  p,  (2) 

for  the  removal  rate  of  surface  molecules  by  argon  ion  sputtering  where  J  is  the  ion  flux, 
/  is  the  current,  S  is  the  sputtering  yield  (i.e.  the  number  of  surface  molecules  removed 
per  incident  argon  ion),  A  is  the  total  area  bombarded,  e  is  the  electronic  charge,  p  is  the 
density  of  molecules  in  surface  and  t  is  the  thickness  of  a  monolayer.  The  accuracy  of 
this  method  depends  upon  the  value  of  S.  In  the  absence  of  measured  values  for  this 
quantity,  the  direct  comparison  of  heats  of  sublimation  of  two  materials  for  at  least  one 
of  which  S  is  known,  would  give  a  satisfactory  estimate  for  the  other.  Method  (b) 
employed  earlier  by  Carlson  and  McGuire  (1972)  is  useful  when  the  film  and  substrate 
contain  a  common  element,  as  is  the  case  for  all  oxide  films  formed  in  corrosion 
processes.  Photoelectron  intensity  from  the  substrate  is  assumed  to  decrease  exponen¬ 
tially  with  thickness  of  overlying  film,  then 

Is  =  /J  exp  (  —  d/2  sin  0),  (3) 

for  electrons  emerging  at  an  angle  6  from  the  surface.  7® is  the  intensity  without  the  film, 
d  is  the  film  thickness  and  X  is  the  attenuation  length  (or  mean  escape  depth)  for 
photoelectrons  of  a  particular  energy.  In  this  method,  the  greatest  uncertainties  lie  in 
assuming  that  the  dynamics  of  photoemission  and  emergence  of  the  electrons  produced 
are  similar  in  all  materials  used  in  making  the  measurements.  In  method  (c)  the  authors 
have  made  use  of  the  fact  that  nitrogen  was  such  an  element  which  occurred  only  in  the 
film.  The  absolute  intensity  of  photoelectrons  emitted  from  N  atoms  has  therefore  been 
correlated  with  the  film  thickness.  The  authors  claim  to  have  achieved  close  agreement 
between  the  values  obtained  by  these  three  methods  for  determining  the  film  thickness. 

Surface  films  formed  by  two  structurally  related  corrosion  inhibitors  viz  2-mercapto 
benzothiazole  (mbt)  and  2-mercapto  benzimidazole  (mbia),  on  copper  studied  by  xps 
and  x-ray  induced  Auger  spectroscopy  have  been  reported  by  Chadwick  and  Hashemi 
(1979).  In  this  work  the  films  were  formed  in  0-0*5  NaCl  solutions  of  pH  2-7  on  two 
types  of  copper  samples — one  mechanically  polished  with  a  thin  Cu20  film  and  the 
other  with  thick  Cu20  film  grown  electrochemically  in  koh.  The  presence  of  mbt  was 
established  by  S  2 p  and  N  Is  peaks  in  the  xps  spectra.  The  measured  Cu  2 p  binding 
energies  and  the  lack  of  Cu  2 p  shake-up  satellites  in  the  spectra  represented  copper  to  be 
either  in  Cu(O)  or  Cu(I)  state.  The  presence  of  mbt  on  copper  surface  did  not  prove  that 
there  was  compound  formation  involving  copper,  and  this  evidence  was  obtained  by 
features  of  CuL3M4  5M4  5  Auger  peak.  According  to  the  mechanism  proposed  for  film 
formation,  Cu20  first  dissolves  in  acid  conditions  followed  by  precipitation  of  Cu- 
inhibitor  complex  on  to  the  metal  surface,  the  thickness  of  the  inhibitor  film  being  low 
on  the  copper  sample  with  thin  Cu20  film.  It  has  therefore  been  suggested  that  Cu  ions 
originating  from  the  Cu20  dissolution  are  involved  in  the  precipitation  process.  The 
authors  have  supported  this  suggestion  by  further  experimental  evidence.  Other 
important  studies  on  inhibitor  films  which  may  be  of  interest  to  the  readers  have  been 
reported  by  Wood  and  Vannerberg  (1978)  and  Kishi  et  al  (1979). 

2.2a  Studies  conducted  at  barc  on  Cu- 10  Ni  alloy  The  corrosion  rate  of  pure  copper  in 
synthetic  sea  water  was  1 2*4  mdd  while  that  of  Cu-lONi  alloy  was  found  to  be  3*2  mdd. 
Ni  alloy  when  exposed  to  synthetic  sea  water  polluted  with  40  ppm  sodium  sulphide 
corroded  at  a  rate  of  5T  mdd.  All  these  rates  were  evaluated  on  the  basis  of  15  days 
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exposure,  xps  investigation  on  these  samples  were  carried  out  to  find  out  the  reason  for 
the  decrease  in  corrosion  rate  of  copper  on  alloying  with  nickel  and  for  the  increase  in 
the  corrosion  rate  of  this  alloy  in  polluted  sea  water  as  compared  to  that  in  unpolluted 
sea  water. 

The  film  formed  on  pure  copper  in  synthetic  sea  water  was  found  to  be  Cu20  in 
accordance  with  the  observation  of  other  workers  (North  and  Pryor  1970;  Popplewell 
et  al  1973;  Kato  et  al  1980).  xps  survey  scan  for  Cu-lONi  alloy  exposed  to  unpolluted 
synthetic  sea  water  (figure  3)  indicated  that  main  elements  in  the  film  were  Cu,  O,  C  and 
traces  of  Cl.  High  resolution  Cu  2 p  spectrum  (figure  4)  showed  no  satellites,  with  2 p3/2 
peak  appearing  at  932*7  eV  and  2 p1/2  peak  at  952*6  eV.  lmm  Auger  peak  maximum  for 
copper  (figure  5)  occurred  at  a  kinetic  energy  of  916*6  eV.  O  Is  peak  was  asymmetric 
with  a  maximum  at  530*7  eV  (figure  6)  and  with  two  higher  binding  energy  shoulders 
e.g.  at  ca  533  eV  and  ca  535  eV.  No  appreciable  signal  from  nickel  was  obtained  from 
the  outermost  film. 

It  is  now  established  that  the  shake-up  satellite  structure  is  closely  related  to  the 
oxidation  state  of  copper  (Larson  1974;  Evans  1975;  Rosencwaig  and  Wertheim  1973; 
Frost  et  al  1972;  McIntyre  and  Cook  1975).  All  cupric  compounds  whose  Cu  2 p  spectra 
have  been  studied  show  satellite  peaks  believed  to  arise  due  to  multi-electron  excitation 
process  whereas  Cu  2 p  spectra  of  cuprous  compounds  do  not  show  satellites.  Satellites 
observed  by  Novakov  (1971),  Novakov  and  Prins  (1971)  and  Schoen  (1973)  in  cuprous 
compounds  could  be  due  to  surface  contamination  by  cupric  species  as  the  cuprous 
compounds  are  susceptible  to  atmospheric  oxidation  into  cupric  (Roberts  1974). 

Wallbank  et  al  (1973)  suggested  with  evidence  that  satellites  originate  in  2 p  spectrum 
from  simultaneous  transition  to  3 d  states  and  if  3 d  states  are  fully  occupied  (as  in 
cuprous  species),  transition  to  them  are  not  possible.  On  the  basis  of  the  above 


Figure  3.  XPS  survey  for  Cu-lONi  alloy  exposed  to  synthetic  sea  water  for  15  days. 
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Figure  6.  O  Is  spectrum  for  Cu-10  Ni  alloy  exposed  to  synthetic  sea  water;  synthetic  sea 
water  after  sputtering;  and  synthetic  sea  water  +  40  ppm  Na2S  for  15  days. 


discussion  it  can  be  concluded  that  copper  in  this  film  is  not  in  cupric  state,  lmm  Auger 
peak  at  a  kinetic  energy  of  916*6  eV  which  is  quite  different  from  that  for  pure  copper 
(at  918-8  eV)  confirms  the  presence  of  cuprous  state  (McIntyre  et  al  1976).  O  Is  peak  at 
530-7  eV  can  be  attributed  to  the  oxide  oxygen  in  Cu20  (McIntyre  et  al  1976;  Evans 
1975).  Higher  binding  energy  shoulders  are  due  to  hydroxide  and  molecular  water 
which  disappear  on  sputtering.  The  difference  between  the  binding  energies  of  Cu  2 p3/2 
and  O  Is  peaks  is  402-0 eV  which  agrees  well  with  the  values  reported  for  Cu20 
(Evans  1975).  Hence,  outermost  corrosion  film  in  this  case  can  be  concluded  to  consist 
of  Cu20  with  traces  of  chlorine  in  the  film. 

There  was  no  significant  change  in  copper  spectrum  on  sputtering,  but  additional 
nickel  signal  was  obtained.  Ni  2 p3/2  peak  showed  two  maxima  e.g.  at  854-5  eV  and  at 
856-4  eV.  There  was  a  shoulder  at  ca  852-8  eV  and  shake-up  satellite  structure  at  ca 
861  eV  and  at  ca  864  eV  (figure  7).  xps  investigations  of  nickel-oxygen  system  reveal  a 
shift  of  1-6  eV  for  Ni  2 p3/2  peak  in  NiO  with  respect  to  that  for  pure  metal  (McIntyre  et 
al  1976;  Kim  et  al  1974;  Brundle  1975;  Allen  et  al  1979).  The  peak  at  854-5  eV  and  a 
satellite  at  ca  861  eV  agree  well  with  the  results  of  Kim  et  al  (1974)  for  NiO  but  the  peak 
at  856-4  eV  is  too  intense  to  be  confused  as  multiplet  splitting  of  the  peak  at  854-5  eV 
(McIntyre  and  Cook  1 975;  Kim  and  Winograd  1 974;  Kim  et  al  1 974).  It  appears  that  the 
peak  at  856-4  eV  and  the  satellite  at  ca.  864  eV  occur  as  a  result  of  higher  oxidation  state 
of  nickel.  O  Is  peak  after  sputtering  became  symmetric  and  the  maximum  occurred  at 
530-4  eV  which  is  not  unexpected  because  the  major  portion  of  the  film  still  consisted  of 
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Binding  energy  (eV) 

Figure  7.  Ni  2p  spectrum  for  Cu-10  Ni  alloy  exposed  to  synthetic  sea  water  for  15  days  after 
sputtering. 


Cu20.  Haber  et  al  (1976)  have  reported  the  binding  energy  values  of  O  Is  peak 
associated  with  Ni3+  and  Ni2+  as  529-3  eV  and  529-8  eV  respectively;  however,  other 
authors  (Kim  and  Winograd  1974;  Brundle  1975;  Norton  et  al  1977)  report  the  value 
associated  with  Ni3  +  as  ca  531-5  eV.  It  can  therefore  be  concluded  that  the  inner  film 
also  consisted  mainly  of  Cu20  with  nickel  ions  in  two  different  valence  states  viz  Ni2  + 
and  Ni3+  either  as  a  dopant  in  Cu20  film  or  simultaneously  existing  as  a  mixture  of 
NiO  and  Ni203,  however,  the  O  Is  spectrum  does  not  provide  evidence  to  the  presence 
of  separate  nickel  oxide  because  of  domination  of  Cu20  in  the  corrosion  film. 

The  film  formed  on  copper  in  synthetic  sea  water  was  Cu20  which  is  a  p-type 
semiconductor  while  that  formed  on  Cu-lONi  alloy  in  the  same  medium  was  Cu20 
doped  with  Ni2+  and  Ni3  +  .  Doping  by  nickel  ions  leads  to  an  increase  in  ionic  and 
electronic  resistivity  and  consequently  the  corrosion  rate  of  the  alloy  decreases  in 
comparison  to  that  of  pure  copper  (Popplewell  1978). 

The  film  formed  on  Cu-lONi  alloy  in  polluted  sea  water  was  uniformly  black  and 
survey  scan  indicated  the  presence  of  Cu,  O,  S  and  C.  Cu  2 p3/2  peak  appeared  at 
932-4  eV  (figure  4)  while  lmm  Auger  peak  occurred  at  a  kinetic  energy  of  916-9  eV 
(figure  5)  indicating  copper  to  be  present  in  cuprous  state.  O  Is  showed  a  peak  with  two 
maxima  viz  532-0  eV  and  532-6  eV  and  three  shoulders  at  ca  530-7  eV  ca  533  eV  and  ca 
535  eV  (figure  6).  The  peak  at  535  eV  represented  molecular  oxygen  while  that  at 
532-6  eV  and  shoulder  at  ca  533  eV  represented  oxygen  in  the  water  molecules  bound  to 
the  surface  film.  The  shoulder  at  ca  530 J  eV  occurred  due  to  Cu20  film  on  the  surface. 
S  2 p  peak  appeared  at  162-0  eV  (figure  8)  with  a  higher  binding  energy  shoulder  at  ca. 
163  eV  and  shake-up  satellite  structure  at  ca  167-5  eV  and  ca  169*6  eV.  This  peak  can  be 
attributed  to  sulphide  species  (Lindberg  et  al  1970;  Binder  1973).  Thus,  the  corrosion 
product  in  this  case  was  a  mixture  of  Cu20  and  Cu2S  with  adsorbed  water  molecules 
and  oxygen.  Cu2S  is  known  to  be  a  more  defective  structure  than  Cu20,  and 
Cu20/Cu2S  corrosion  product  film  is  much  more  defective  than  Cu20  alone,  thus 
resulting  in  an  increased  ionic  and  electronic  transport  through  the  film  (Bates  and 
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Figure  8.  S  2p  spectrum  for  Cu-10  Ni  alloy  exposed  to  synthetic  sea  water  +  40  ppm  Na2S 
for  15  days. 


Popplewell  1975).  The  resultant  crystals  will  have  a  mismatch  at  interfaces  that  would 
result  in  structural  defects  large  enough  to  permit  the  mass  transport  of  moisture  with 
dissolved  molecular  and  ionic  species.  Such  a  porous  structure  is  responsible  for  the 
increase  in  corrosion  rate  in  polluted  sea  water. 

2.3  Pitting  studies 

Sugimoto  and  Sawada  (1977)  have  studied  the  effect  of  molybdenum  additions  to 
stainless  steels  in  inhibiting  pitting  in  acid  chloride  solutions.  Solution  annealed 
austenitic  20Cr-25Ni  steels  containing  0-5  wt%  Mo  were  polarised  in  deaerated  IN 
HC1  (pH  0  02)  in  steps  of  25  mV  from  the  corrosion  potential  and  xps  investigations 
were  carried  out  at  different  stages  of  polarization.  Anodic  polarization  of  pure  Fe,  Cr, 
Ni  and  Mo  showed  that  Fe  and  Ni  did  not  passivate  and  MO  exhibited  transpassive 
anodic  dissolution  above  —  0  03  V  and  consisted  a  film  of  Mo03  above  —  0  03  V. 
Anodic  polarization  of  Fe-Cr  alloys  containing  1 5-70  wt  %  Cr  indicated  no  pitting  in 
the  alloys  containing  >  40  wt  %  Cr.  On  alloying  with  5  wt  %  Mo  the  amount  of  Cr 
needed  to  inhibit  pitting  was  reduced  to  half  of  those  without  Mo.  xps  investigations  on 
the  passive  film  formed  on  20  Cr-25  Ni-5  Mo  steels  showed  the  film  to  be  an 
oxyhydroxide  which  contained  Fe3  +  ,  Cr3  +  ,  Ni2+  and  Mo6+  as  cations.  The  authors 
have  concluded  that  Cr3+  and  Mo6+  play  a  very  important  role  in  improving  the 
corrosion  resistance  of  the  films.  The  formation  of  very  compact  surface  films  of 
CrOOH  containing  cations  like  Mo6+,  Fe3+  and  Ni2  +  has  been  concluded  to  be  the 
reason  for  increase  in  pitting  resistance  of  Cr-Ni  steels  on  addition  of  Mo. 
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2.4  Studies  on  sacrificial  anodes  for  cathodic  protection 

Studies  on  the  xps  investigations  on  aluminium  sacrificial  anodes  have  been  carried  out 
in  authors’  laboratory  and  have  been  reported  in  detail  elsewhere  (Pai  et  al  1982).  Pure 
aluminium  is  not  recommended  for  sacrificial  anodes  because  of  passive  oxide  film  on 
its  surface.  Judicious  additions  of  Zn,  Mg,  Cd,  Hg,  Sn  and  In  as  alloying  elements  can 
possibly  modify  this  oxide  film  on  aluminium  and  render  Al  anodes  suitable  for 
cathodic  protection  of  steel  structures  in  sea  water. 

In  this  investigation,  studies  were  carried  out  on  Al-5Zn-X  Sn  alloys  where  X  varied 
from  067  %  to  05  %.  Current  efficiencies  (%)  of  alloys  with  different  heat  treatments 
were  measured  after  coupling  to  mild  steel  in  sea  water  for  180  hr.  Alloy  AF5Zn-0T  Sn 
(heated  for  3  hours  at  300°C  and  water  quenched)  was  found  to  behave  best  as  a 
sacrificial  anode  while  the  same  alloy  after  heating  for  3  hr  at  300°C  and  furnace 
cooling,  was  not  found  to  be  so  good.  On  exposing  these  specimens  to  sea  water,  xps 
studies  indicated  that  the  film  formed  on  the  quenched  alloy  contained,  in  addition  to 
A1203,  relatively  greater  amount  of  ZnO  than  that  formed  on  the  furnace  cooled  alloy. 
In  the  oxide  layer  where  ZnO  and  A1203.3H20  are  formed,  Al3  +  (ionic  radius  0*51  A) 
ions  act  as  dopant  in  ZnO  film  (ionic  radius  of  Zn2+  0.93  A)  and  increase  the  cation 
vacancy  concentration.  The  higher  oxygen  potential  gradient  developed  due  to  reduced 
aluminium  activity  favours  the  higher  rate  of  dissolution  of  Al  alloy  in  sea  water  by 
providing  driving  force  for  the  movement  of  cations  from  alloy-oxide  interface. 
Obviously,  the  quenched  alloy  in  which  the  amount  of  ZnO  formed  is  more,  will  behave 
as  a  better  sacrificial  anode. 


3.  Conclusions 

From  the  above  illustrations,  it  can  be  seen  that  xps  is  a  very  powerful  technique  for 
understanding  and  solving  the  corrosion  problems.  It  has  been  clearly  demonstrated 
that  xps  can  be  used  for  determining  the  nature,  thickness  and  composition  of  passive 
films,  for  inhibition  and  pitting  studies  and  for  understanding  the  phenomenon 
occurring  in  the  cathodic  protection.  The  authors’  studies  indicate  that  formation  of 
FeOOH  provides  protection  to  carbon  steel  against  corrosion  in  deaerated  alkaline 
medium,  better  corrosion  resistance  of  Cu-Ni  alloys  in  sea  water  is  due  to  the  formation 
of  Cu20  film  doped  with  Ni2  +  and  Ni3  +  ,  the  enhancement  of  corrosion  of  these  alloys 
in  sulphide  polluted  sea  water  occurs  due  to  the  formation  of  porous  corrosion  product 
Cu20/Cu2S  and  sacrificial  aluminium-zinc-tin  alloy  anodes  behave  better  when  the 
oxide  film  is  rich  in  zinc. 
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Abstract.  This  paper  gives  a  brief  review  of  the  important  phenomena  observed  in  metallic 
uranium  and  ceramic  nuclear  fuels  during  irradiation  in  reactors.  The  mechanism  of 
irradiation  growth,  irradiation  creep  and  swelling  which  are  responsible  for  the  dimensional 
instability  of  uranium  has  been  described.  Important  phenomena  observed  in  ceramic  nuclear 
fuels,  e.g.  fuel  densification,  fuel  restructuring,  plutonium  segregation,  oxygen  and  fission 
product  migration,  irradiation  creep,  fission  gas  release  and  swelling  have  been  discussed.  A 
brief  note  is  included  on  computer  modelling  for  prediction  of  fuel  element  irradiation 
behaviour. 

Keywords.  Irradiation  effects;  nuclear  fuels;  metallic  fuels;  ceramic  fuels;  irradiation 
behaviour;  irradiation  growth;  irradiation  creep;  swelling;  densification;  restructuring;  fission 
gas  release;  fuel  modelling. 

PACS  No.  28-40;  28  90 


1.  Introduction 

Nuclear  fuels  are  exposed  to  a  unique  and  very  severe  thermal  and  radiation 
environment  during  their  use  in  reactors.  The  behaviour  of  the  fuel  material  in  this 
environment  is  quite  different  from  its  behaviour  outside  the  reactor.  It  is  governed  by 
the  complex  interplay  of  a  large  number  of  interactive  physical,  chemical,  mechanical 
and  metallurgical  processes  which  become  operative  in  the  fuel  in  the  reactor 
environment.  A  great  deal  of  studies  have  been  made  and  are  still  continuing  to 
understand  the  irradiation  behaviour  of  fuels  in  nuclear  reactors.  These  studies  have 
revealed  spectacular  changes  in  the  geometry,  dimensions,  composition  and  micro¬ 
structure  of  the  fuels  during  and  after  fission  due  to  irradiation.  In  this  paper  some  of 
the  important  phenomena  observed  in  two  types  of  nuclear  fuels  viz  metallic  uranium 
fuels  and  ceramic  fuels  (oxide,  mixed  oxide  and  mixed  carbide),  which  are  relevant  to 
India’s  nuclear  programme,  are  described.  As  oxides  are  now  the  most  widely  used  fuels 
in  the  present-day  nuclear  power  reactors,  they  have  been  discussed  at  length.  The 
paper  briefly  reviews  the  important  findings  about  these  phenomena  and  their 
mechanisms. 


2.  Irradiation  damage  in  solid  nuclear  fuels 

The  radiation  environment  in  the  nuclear  reactor  core  consists  of  fission  fragments, 
high  energy  neutrons,  charged  particles  and  other  radiations.  The  main  source  of 
radiation  damage  in  nuclear  fuels  is  the  highly  ionised  fission  fragments  carrying  a  total 
of  165  MeV  energy  and  the  fast  neutrons  which  have  an  energy  upto  10  MeV.  Fission 
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fragments  and  neutrons  move  rapidly  through  the  lattice,  exchanging  their  energy  with 
the  lattice  atoms  until  they  come  to  rest.  The  damage  to  the  fuel  by  fission  fragments  is 
caused  in  the  following  three  ways: 

(i)  Fission  spikes :  Each  fission  fragment  is  highly  ionised  (about  20  -f)  and  has  a  range 
of  6-1 0  microns  in  the  fuel  matrix.  Over  this  small  distance,  the  fission  fragment  releases 
all  its  energy  which  has  an  effect  of  raising  the  fuel  temperature  to  more  than  3000°C  for 
a  brief  period  of  10“ 11  seconds.  This  zone  is  called  a  fission  spike.  It  is  a  cylindrical 
region  of  about  100  A  in  radius  with  the  fission  fragment  track  as  its  axis.  The  end 
portion  of  the  fission  spike  where  the  fission  fragment  comes  to  rest  is  characterised  by 
severe  disarrangement  of  lattice  atoms  and  is  often  called  displacement  spike  while  the 
rest  is  called  thermal  spike  (figure  1).  Once  the  fission  fragments  come  to  rest,  they  are 
more  usually  described  as  fission  products. 

(ii)  Lattice  defects :  Collision  of  fission  fragments  with  lattice  atoms  of  the  fuel  matrix 
produces  primary  knock-on  atoms*which  have  sufficient  energy  to  displace  other  atoms 
from  the  fuel  lattice.  This  gives  rise  to  creation  of  several  types  of  point  defects  in  the 
lattice  (figure  2).  The  formation  of  lattice  defects  in  the  fuel  matrix  provides  the  impetus 
for  atomic  diffusion  of  various  species.  All  diffusion-controlled  processes  are  enhanced 


Figure  1.  Schematic  representation  of  the  fission  spike  in  nuclear  fuel.  The  fission  fragment 
enters  the  lattice  at  left  as  shown  by  the  arrow  mark  (Kopelman  1959). 


EXCHANGE  DYNAMIC 

CLOSE  FRENKEL  PAIR  COLLISIONS  CROWDION 


Figure  2.  Schematic  picture  illustrating  the  formation  of  different  types  of  lattice  defects  by 
a  primary  knock  on  atom.  The  knock-on  atom  enters  the  lattice  at  left  and  comes  to  rest  at  P 
(Chadderton  and  Torrens  1969). 
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and  new  transport  processes  are  induced  at  quite  low  temperatures.  The  two  basic  types 
of  lattice  defects,  namely,  the  vacancy  and  the  interstitial  recombine  if  the  temperature 
is  high.  But  they  remain  as  single  defects  at  lower  temperature  and  have  a  large  effect  on 
the  fuel  element  behaviour. 

(iii)  Impurity  atoms :  Each  fission  event  creates  two  atoms  of  fission  products  which  are 
chemically  different  from  the  original  material.  Presence  and  build-up  of  these  impurity 
atoms  influence  properties  of  the  fuels  and  a  detailed  study  of  the  fission  products  is 
therefore  very  important. 

3.  Irradiation  effects  in  metallic  fuel 

Uranium  metal  exists  in  three  allotropic  modifications  with  different  crystallographic 
structure:  Orthorhombic  alpha  uranium  upto  668 °C,  tetragonal  beta  uranium 
(668-775  C)  and  body-centred  cubic  gamma  uranium  (775-1 130°C).  Most  metallic 
uranium  fuels  are  used  in  the  alpha  state.  Orthorhombic  alpha  uranium  shows  highly 
anisotropic  thermal  expansion  and  exhibits  poor  dimensional  stability  during  irradi¬ 
ation.  The  dimensional  changes  in  alpha  uranium  result  from  three  types  of  irradiation- 
induced  phenomena:  irradiation  growth,  irradiation  creep  and  fission  product  swelling. 

3.1  Irradiation  growth 

Irradiation  growth  refers  to  the  change  in  uranium  shape  at  constant  volume  without 
any  external  stress  application.  The  phenomena  of  growth  were  reported  by 
Konobeevsky  et  al  (1955),  Paine  et  al  (1955)  and  Pugh  (1955). 

During  irradiation  a  single  crystal  of  alpha  uranium  exhibits  growth  in  <010) 
direction,  shrinkage  in  <  100  )  direction  and  remains  unchanged  in  <001  )  direction.  In 
a  polycrystalline  sample  the  extent  of  dimensional  changes  depend  upon  the  degree  of 
preferred  orientation  in  the  specimen.  The  growth  of  uranium  is*  expressed  by  the 
following  correlation  (Robertson  1969): 

L  =  L0  exp  (G/), 

where  L  is  the  length  of  specimen  after  a  fraction / of  the  atoms  have  fissioned,  L0  is  the 
initial  length  of  the  specimen  and  G  is  the  growth  constant.  The  value  of  G  denotes  the 
susceptibility  of  uranium  to  growth  and  depends  upon  uranium  texture  and  irradiation 
temperature.  Growth  constant  G  has  very  large  values  at  low  temperature  (sub-zero 
temperature)  but  relatively  independent  of  temperature  in  the  range  0-300°C  and  small 
values  at  high  temperatures  (Buckley  1964).  No  growth  is  observed  at  temperatures 
above  500°C. 

The  phenomena  of  irradiation  growth  is  connected  with  the  anisotropic  thermal 
expansion  of  the  alpha-uranium  (figure  3).  The  direction  of  growth  in  uranium 
coincides  with  the  direction  which  has  a  negative  coefficient  of  thermal  expansion. 
Similarly  the  direction  which  shrinks  during  irradiation  corresponds  to  the  direction  of 
highest  thermal  expansion  coefficient.  The  mechanism  of  irradiation  growth  involves 
formation  of  extra  monolayer  plates  of  lattice  defects  created  by  the  passage  of  fission 
fragments  in  solid  uranium  on  certain  crystal  planes  (figure  4).  The  thermal  stresses  due 
to  anisotropic  expansion  of  uranium  in  the  fission  spike,  force  the  vacancies  and 
interstitials  to  aggregate  on  different  crystal  planes  to  cause  elongation  (due  to 
interstitial  clusters)  in  the  direction  <010  )  and  contraction  (due  to  vacancy  cluster)  in 
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Figure  3.  Lattice  parameters  of  alpha  uranium  as  a  function  of  temperature  (Holden  1958). 


Figure  4.  Projection  of  the  alpha  uranium  lattice  on  (001)  plane  showing  deformation 
caused  by  collapsed  aggregate  of  vacancies  on  a  (1 10)  plane  AB  and  a  disc  of  interstitial  atoms 
on  a  (010)  plane  CD,  (Ostberg  1965). 


<100)  direction.  At  higher  temperature  (above  500CC)  the  point  defects  created  in 
uranium  are  sufficiently  mobile  to  get  annhilated  and  hence  no  growth  is  observed. 

Since  growth  is  enhanced  by  preferred  orientation  the  only  way  to  control  growth  is 
to  have  a  random  texture.  This  is  obtained  by  beta-quenching  of  uranium  which  assures 
fine  randomly  oriented  grains  in  the  fuel. 

3.2  Irradiation  creep 

Irradiation  creep  and  irradiation  growth  play  an  important  role  in  the  often  observed 
bowing  of  the  uranium  fuel  element  during  irradiation  (figure  5). 
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Figure  5.  Bowing  observed  in  an  experimental  uranium  fuel  element  of  Dhruva  reactor 
(irradiated  in  cirus). 
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Accelerated  creep  of  uranium  during  irradiation  was  reported  by  Konobeevsky  et  al 
(1955).  It  was  found  by  Roberts  and  Cottrell  (1956)  that  in  a  thermal  neutron  flux  of 
1012  n/cm2/sec  at  temperature  of  100°C,  a  stress  as  low  as  1  %  of  yield  strength  caused 
appreciable  creep  in  polycrystalline  uranium.  This  observation  was  explained  on  the 
basis  of  yielding  creep  mechanism  (figure  6).  In  this  mechanism  creep  is  caused  by 
build-up  of  high  internal  stresses  between  the  grains  due  to  growth  of  misaligned  grain 
in  a  polycrystalline  material.  When  the  internal  stress  reaches  the  yield  stress,  the 
material  deforms  plastically  under  the  influence  of  an  externally  applied  stress  which  is 
small  compared  to  the  yield  stress.  Most  of  the  work  is  done  by  internal  stress,  the 
external  stress  merely  guides  the  deformation.  The  creep  rate  due  to  this  mechanism  is 
given  by  the  following  correlation  (Gilbert  1971) 

4  =  (<r/<ry)eg, 

where  ec  is  irradiation  creep  rate,  oy  is  the  yield  stress,  eg  is  the  growth  strain  rate  and  er  is 
the  applied  stress. 

The  dependence  of  creep  rate  on  neutron  flux  and  temperature  is  contained  in  the 
growth  strain  rate. 

Another  mechanism  of  irradiation  creep  in  uranium  deals  with  relaxation  of  elastic 
stress  in  the  crystal  lattice  volume  temporarily  disorganised  by  a  fission  spike  (figure  7). 
The  creep  rate  due  to  this  mechanism  is  given  by  the  following  correlation  (Dienst 
1977). 


where  o  is  the  stress,  E  is  Young’s  modulus  and  F  is  the  fission  rate. 
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Figure  6.  Yielding  creep  mechanism.  Irradiation  growth  of  misaligned  adjacent  grains 
produces  internal  stress  (Gilbert  1971). 
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Figure  7.  Relaxation  of  elastic  stress  in  a  fission  spike.  Arrows  indicate  stress,  broken  lines 
indicate  initial  specimen  shape,  solid  lines  indicate  final  specimen  shape,  vsp  is  volume  of  spike 
(Dienst  1977). 


3.3  Swelling 

Swelling  due  to  replacement  of  uranium  atom  by  fission  products  was  calculated  by 
Howe  and  Weber  (1957)  who  showed  that  the  percentage  volume  increase  in  uranium 
should  be  about  three  times  the  atom  percent  burn-up.  This  estimate  compares  well 
with  the  swelling  at  lower  temperatures  where  gaseous  swelling  is  absent.  However, 
large  amount  of  swelling  is  observed  in  high  purity  uranium  in  the  temperature  range 
35Q-600°C.  A  breakaway  swelling  (swelling  maxima)  is  observed  at  about  450°C  and 
the  magnitude  of  swelling  is  more  in  high  purity  uranium  compared  to  the  swelling  in 
uranium  containing  small  amount  of  impurities  (figure  8).  Swelling  in  the  temperature 
range  350-500°C  is  characterised  by  formation  of  large  (50-200  microns)  cavities  and 
tearing  at  grain  boundaries  and  at  twins  (Pugh  1955).  In  temperature  range  500-600°C, 
swelling  is  associated  with  formation  of  small  platelike  pores  crystallographically 
aligned  in  planar  arrays  within  uranium  grains  (Legget  et  al  1964).  The  burn-up  at 
which  enhanced  swelling  appears  in  uranium  depends  on  the  temperature  of 
irradiation.  At  higher  temperature  of  irradiation,  the  swelling  initiates  at  low  burn-up 
but  when  the  temperature  is  low  the  breakaway  swelling  can  be  delayed  upto  a  high 
burn-up.  External  restraint  is  found  to  suppress  the  formation  of  cavities  and  aligned 
pores,  reducing  fuel  swelling.  Minor  addition  of  alloying  elements  Fe,  Al,  C,  Si  etc  in 
uranium  inhibits  the  swelling. 

Metal  fuels  operate  at  lower  fractions  of  their  melting  points  where  fission  gas 
diffusion  is  fairly  slow.  Moreover,  the  thermal  gradient  across  the  fuel  is  small  due  to 
higher  thermal  conductivity  and  the  driving  force  for  directed  bubble  migration  is 
small.  Bubbles  generally  remain  quite  small  and  gas  release  is  negligible.  The  swelling  of 
uranium  originates  from  the  nucleation  and  growth  of  bubbles  of  fission  gases  xenon 
and  krypton  which  are  insoluble  in  fuel  matrix  and  tend  to  precipitate  as  bubbles  on 
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Figure  8.  Swelling  during  irradiation  as  a  function  of  temperature,  A — high  purity  uranium. 
B— normal  purity  uranium.  C — adjusted  uranium  (Robertson  1969). 


grain  boundaries,  dislocation  and  twins.  However,  the  anisotropic  growth  of  uranium  is 
considered  to  be  the  main  driving  force  (in  temperature  range  350-500°C)  for  the 
formation  of  cavities  and  tears  which  are  responsible  for  large  swelling  in  uranium.  The 
cavities  are  nucleated  at  the  existing  fission  gas  bubbles.  Alternatively  the  cavities  can  be 
nucleated  mechanically  by  the  intergranular  stresses  arising  from  anisotropic  growth  of 
adjacent  grains.  When  the  gas  bubbles  or  grain  boundary  tears  exceed  a  critical  size 
(2000  A),  they  rapidly  grow  to  form  large  cavities.  The  growth  occurs  by  migration  of 
vacancies  to  the  cavity  under  the  influence  of  tensile  stress  caused  by  intergranular 
interaction  resulting  from  anisotropic  growth  of  uranium.  In  the  temperature  range 
500~600°C  where  irradiation  growth  is  negligible,  the  swelling  is  believed  to  be  caused 
by  homogeneous  nucleation  of  voids  by  condensation  of  vacancies  (McDonell  1973). 
The  suppression  of  swelling  by  addition  of  alloying  elements  is  attributed  to  interaction 
of  point  defects  with  alloy  constituents. 

At  one  time  it  was  thought  that  the  swelling  of  metal  fuels  was  sensitive  to  the  fuel 
strength  and  therefore  could  be  reduced  by  making  stronger  fuels  with  addition  of  alloy 
elements  like  molybdenum.  It  was,  however,  recognised  later  that  the  main  restraining 
force  on  gas  bubbles  was  surface  tension  rather  than  the  matrix  strength.  Hence, 
keeping  the  gas  bubbles  smaller  e.g.  by  providing  numerous  nuclei  on  which  they  can 
form  rather  than  strengthening  the  matrix,  could  be  more  effective. 


4.  Irradiation  effects  in  ceramic  fuels 

In  addition  to  the  intense  radiation  field,  the  ceramic  fuels  are  exposed  to  very  severe 
thermal  conditions.  Volumetric  fission  heat  generation  in  ceramic  fuels  gives  rise  to 
large  temperature  difference  between  the  centre  line  and  external  surface  of  the  fuel 
elements.  A  typical  mixed  oxide  fuel  operates  with  a  high  centre  temperature 
approaching  melting  point  and  there  exists  a  radial  thermal  gradient  as  high  as 
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Figure  9.  Changes  observed  in  ceramic  fuels  during  irradiation  due  to  severe  thermal  and 
radiation  environment  in  the  fuel  pellets  (Roy  and  Sah  1983). 


3000-5000°C/cm.  Besides  causing  cracking  and  distortion  of  the  fuel  pellets  as  a  result 
of  thermal  stress,  these  unique  irradiation  conditions  provide  the  driving  force  for 
several  transport  processes  in  fuel  which  lead  to  significant  changes  in  the  microstruc¬ 
ture,  composition,  dimension  and  shape  of  the  fuel.  The  main  irradiation  effects 
(figure  9)  can  be  divided  in  the  following  broad  categories:  (a)  densification; 
(b)  restructuring;  (c)  redistribution  of  fuel  and  fission  products;  (d)  effects  of  fission 
gases:  swelling  and  gas  release;  (e)  irradiation  creep 

4.1  Irradiation-induced  densification 

The  problem  of  irradiation-induced  densification  which  results  in  formation  of  large 
axial  gaps  in  the  fuel  column  and  leads  to  collapse  of  cladding  tube  in  thes6  gaps,  was 
first  observed  in  1972  in  large  pressurised  water  reactors  (pwrs).  This  sent  a  shudder 
through  the  reactor  engineers  who  had  to  derate  the  reactor  power  pending 
investigations  into  the  mechanism  and  possible  solution  of  fuel  densification  under 
irradiation.  Fortunately,  within  a  short  period  the  mechanism  was  correctly  identified 
which  defined  the  requisite  microstructure  (consisting  of  large  grains  and  controlled 
porosity)  of  a  densification  resistant  fuel.  Use  of  densification  resistant  fuel  along  with 
internal  prepressurisation  of  the  fuel  element  provided  a  satisfactory  solution  to  the 
problem. 

The  densification  of  ceramic  fuels  during  irradiation  is  caused  by  fission  spike-pore 
interaction  which  results  in  shrinkage  and  dissolution  of  pores  present  in  the  fuel 
matrix  (figure  10).  It  takes  place  quite  early  in  life  and  has  much  greater  technological 
significance.  Irradiation-induced  pore  shrinkage  and  annihilation  was  observed  by 
Wapham  (1966)  using  electron  microscope.  Later  Ross  (1969),  Bellamy  and  Rich  (1969) 
and  Turnbull  and  Cornell  (1970)  reported  similar  observations.  A  fission  fragment 
traversing  through  fuel  grain  disrupts  the  pores  on  its  track.  The  pore  is  transformed 
into  lattice  vacancies  in  this  process.  The  vacancies  generated  in  this  event  migrate  to  the 
nearby  pores,  or  to  the  grain  boundaries.  Vacancy  migration  to  the  grain  boundary 
results  in  densification  of  the  fuel.  Due  to  large  number  of  vacancies  generated  by  one 
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AS  FABRICATED  PORE  INSIDE  THE 
FUEL  GRAIN 
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DISSOLUTION  OF  THE  PORE  INTO 
VACANCIES  BY  FISSION  SPIKE  EFFECT. 


MIGRATION  OF  THE  VACANCIES 
TO  GRAIN  BOUNDRY 


DENSIFIC ATION  OF  THE  GRAIN 
DUE  TO  REMOVAL  OF  THE  PORE 
FROM  THE  FUEL  GRAIN  BODY. 


Figure  10.  Mechanism  of  in-pile  fuel  densification  (Chubb  et  al  1975). 


fission  event  and  large  fission  rate  (1013  fissions/c.c),  the  rate  of  densification  is 
considerable. 

Assman  and  Stehle  (1978)  derived  an  equation  for  volume  change  in  fuel  due  to 
densification.  They  considered  two  populations  of  pores  in  the  fuel  body,  coarse  pores 
which  are  relatively  stable  and  fine  pores  ( <  01  /xm)  which  are  atomised  in  a  single 
encounter  with  fission  spike.  Assuming  densification  to  occur  by  flux  of  vacancies  from 
grain  body  to  the  grain  boundaries  they  gave  a  correlation  of  the  following  form  for  the 
volume  change  (AV/V0)  in  the  fuel: 

where 


and 

In  the  above  correlation  Qs  is  volume  of  fission  spike,  Ft  is  burn-up  in  fission/c.c,  n*  is 
the  fraction  of  vacancies  per  encounter  that  can  escape  from  the  fine  pores.  The  factor  /* 
(<  1)  describes  the  effect  of  coarse  pores  which  capture  a  fraction  of  vacancies 
produced.  rc  is  radius  of  coarse  pores,  rg  is  grain  size  (radius),  Zc  and  Zp  are  densities  of 
coarse  and  fine  pores  respectively,  P0  is  initial  porosity.  For  very  small  initial 
concentration  of  fine  pores  in  the  fuel  body /*  =  1  and  n*  =  n.  The  above  correlation 
shows  an  exponential  law  for  pore  removal  from  fuel  and  it  also  brings  out  the 
important  parameters  which  have  significant  effect  on  in-pile  fuel  densification. 


AF/F0=  -/*P0[l-exp(-n*QsFt)], 

,,  i 

1  +  (4n/\5)Zcrcrj  ’ 

„*  =  (1  -(4*/45  )Zprpr2g)n. 
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4.2  Fuel  restructuring 

The  microstructure  of  the  unirradiated  sintered  ceramic  fuel  pellet  consists  of  the 
equiaxed  grains  of  5-15  micron  diameter  with  uniformly  distributed  porosity.  The 
cross-section  of  irradiated  fuel  however,  reveals  concentric  zones  of  altered  microstruc¬ 
tures  (figures  11  and  12)  which  are  indicative  of  the  temperature  in  the  fuel  during 
irradiation.  These  zones  have  fairly-defined  temperatures  associated  with  their 
boundaries.  The  different  microstructural  zones  and  the  temperature  range  in  which 
they  are  formed  in  mixed  oxide  and  mixed  carbide  fuels  are  listed  in  table  1.  Equiaxed 
grain  growth  law  is  given  by  the  following  correlation  (Olander  1976): 


Dm  =  D™  +  K0t  Qxp(  —  Q/RT) 


where  D  and  D0  are  the  final  and  the  initial  grain  size,  K0  is  a  constant,  t  is  time,  Q  is 
activation  energy  and  T  is  temperature.  The  exponent  m  and  the  activation  energy  were 
found  to  be  2-5  and  460  kJ/mol  respectively  for  U02  in  laboratory  experiments.  Under 
irradiation  conditions,  the  presence  of  fission  gas  bubbles  on  grain  boundary  has  a 
retarding  effect  on  grain  growth  rate.  Ainscough  et  al  (1973)  suggested  the  following 
grain  growth  law  for  U02  during  irradiation: 


d(D)/dt  = 


1  +  2000  QF  A 

~Dm  ) 
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FUEL 
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Figure  11.  The  cross-section  of  the  irradiated  U02  fuel  element  of  Tarapur  Atomic  Power 
Station  (taps)  revealing  formation  of  a  porous  central  region  in  the  fuel  pellet,  (magnification 
6X).  The  maximum  fuel  centre  temperature  was  estimated  to  be  1250  C  (Bahl  et  al  1979). 
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Figure  12.  Restructuring  observed  in  fuel  pellet  of  Tarapur  Atomic  Power  Station 
(magnification  5X).  The  maximum  fuel  centre  temperature  was  estimated  to  be  2200°C. 


Table  1.  Microstructural  zones  observed  in  the  cross-section  of  an 
irradiated  ceramic  nuclear  fuel  pellet. 


Fuel  type 

Temperature 

range 

Microstructural 

observations 

(U,  Pu)02 

below  1100°C 

As  sintered  microstructure 

1 100-1 300°G 

Intergranular  porosity 

1 300-1 700°C 

Equiaxed  grain  growth 

Above  1700°C 

Columnar  grain  growth  and  central 
void 

(U,  Pu)C 

below  1100°C 

As  fabricated  microstructure  and 
very  small  bubbles 

1 100-1400°C 

Grain  growth,  large  gas  bubbles, 
bubbles  at  grain  boundaries 

1400-1 700°'C 

Radially  elongated  grains  and  pores 

Above  1700°C 

Irregular,  big  rounded  pores  and 
central  void 

where  Dm  is  limiting  grain  size,  k  is  the  grain  growth  constant,  F  is  fission  density  and  Q 
is  atomic  volume.  Formation  of  radially  oriented  columnar  grains  involves  movements 
of  pores  up  the  temperature  gradient  by  an  evaporation  and  condensation  mechanism. 
F  uel  on  the  hotter  side  of  a  large  lenticular  pore  evaporates  and  condenses  on  the  cooler 
side  creating  a  transport  process  for  the  pores  to  migrate  up  the  thermal  gradient.  The 
movement  of  the  pores  to  the  centre  of  the  fuel  results  in  high  density  columnar  grains 
and  the  formation  of  a  central  hole  or  ‘void’.  It  can  take  as  little  as  24  hours  at  power  for 
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restructuring  to  occur.  The  rate  of  columnar  grain  growth  is  governed  by  temperature 
and  thermal  gradient.  The  process  is  important  at  temperatures  where  the  vapour 
pressure  of  the  ceramic  fuel  becomes  significant. 

4.3  Redistribution  of  fuel  constituents  and  fission  products 

4.3a  Plutonium  redistribution :  Plutonium-bearing  ceramic  fuels  initially  contain 
uniformly  distributed  plutonium.  During  irradiation,  however,  plutonium  tends  to 
segregate  to  some  preferred  locations.  In  mixed  oxide  (U,  Pu)02  fuel  the  segregation  of 
plutonium  is  observed  to  be  related  to  the  initial  stoichiometry  of  the  fuel.  The  radial 
profile  of  plutonium  concentration  measured  in  an  irradiated  mixed  fuel  of  two 
different  initial  stoichiometry,  showed  that  in  fuel  having  an  initial  O/M  of  2  01  the 
plutonium  segregated  towards  fuel  centre  while  in  fuel  having  an  initial  O/M  of  1-935 
the  plutonium  segregation  occurred  away  from  the  fuel  centre  (figure  13).  This 
indicated  the  possibility  of  the  existence  of  a  critical  value  of  stoichiometry  for  which  no 
plutonium  segregation  should  occur.  However,  plutonium  will  segregate  towards 
periphery  or  towards  centre  depending  upon  the  fact  whether  the  initial  O/M  of  fuel 
was  lower  or  higher  than  the  critical  value  respectively. 

The  observed  plutonium  segregation  is  successfully  explained  by  vapour  transport 
mechanism  which  involves  preferential  evaporation  and  migration  of  one  of  the  fuel 
species.  The  basis  for  vapour  transport  mechanism  is  migration  of  lenticular  pores 
towards  fuel  centre  by  evaporation  and  condensation  of  fuel  material.  The  plutonium 
segregation  is  caused  by  the  difference  in  the  U/Pu  ratio  in  the  solid  and  its  vapour 
inside  the  migrating  pore.  When  the  O/M  ratio  of  the  fuel  is  such  that  congruent 
evaporation  can  take  place  then  no  plutonium  segregation  will  occur.  If  the 
stoichiometry  is  higher  (than  the  stoichiometry  for  congruent  evaporation)  the  vapour 


Figure  13.  Radial  plutonium  profile  observed  after  irradiation  in  mixed  oxide  fuels  of  two 
different  O/M  ratios  (Bramman  and  Powell  1975). 
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phase  in  the  pore  consists  almost  entirely  of  U03  with  minor  amount  of  U02  and 
Pu02.  This  leads  to  preferential  migration  of  uranium  from  high  temperature  central 
region  making  the  centre  rich  in  plutonium.  When  the  O/M  of  the  fuel  is  lower,  the 
vapour  phase  inside  the  pore  consists  of  PuO  which  leads  to  depletion  of  plutonium 
from  the  centre.  Attempts  have  also  been  made  to  explain  the  redistribution  of 
plutonium  on  the  basis  of  solid  state  thermal  diffusion  of  plutonium  (Beiswenger  et  al 
1967;  Bober  et  al  1969).  Though  it  was  possible  to  predict  plutonium  segregation 
towards  centre,  the  mechanism  of  thermal  diffusion  cannot  explain  the  reversal  in  the 
direction  of  plutonium  segregation  observed  at  low  O/M  ratio. 

4.3 b  Oxygen  redistribution :  In  a  mixed  oxide  fuel  containing  appreciable  amount  of 
plutonium,  oxygen  also  gets  redistributed  under  the  influence  of  radial  thermal 
gradient.  The  oxygen  migration  in  the  fuel  is  governed  by  the  initial  stoichiometry  of 
the  fuel.  In  a  hypostoichiometric  fuel,  the  oxygen  migrates  towards  the  low  temperature 
side  ( i.e .  towards  fuel  periphery)  while  in  a  hyperstoichiometric  fuel  the  oxygen 
migration  occurs  from  low  temperature  to  high  temperature  side  (figure  14).  In  both 
cases,  the  fuel  surface  tends  to  approach  stoichiometry. 

The  redistribution  of  oxygen  observed  in  the  mixed  oxide  fuel  is  explained  by  two 
mechanisms  (Olander  1976):  (i)  oxygen  transport  in  gas  phase  by  counter  diffusion  of 
C02  and  CO,  (ii)  thermal  diffusion  of  oxygen  vacancies.  The  mechanism  of  oxygen 
transport  through  gas  phase  is  based  on  the  attainment  of  a  thermodynamic 
equilibrium  among  C02,  CO  and  oxygen  in  the  existing  temperature  gradient.  The 
gases  CO  and  C02  are  formed  from  the  carbon  impurity  in  the  fuel.  In  a 
hyperstoichiometric  fuel  the  C02  diffusing  to  the  hotter  zone  releases  an  oxygen  atom 
to  the  solid  and  the  CO  formed  diffuses  back  to  the  cooler  zone  where  it  picks  up  an 
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Figure  14.  Redistribution  of  oxygen  in  mixed  oxide  fuel  pellet  of  different  initial  stoic¬ 
hiometry  (Assman  and  Stehle  1982). 
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oxygen  atom  to  form  C02  which  repeats  the  cycle.  This  happens  till  a  constant 
C02/C0  ratio  determined  by  the  initial  composition  of  the  fuel  is  achieved. 

The  above  model  meets  with  difficulty  when  applied  to  transport  in  the  hypo- 
stoichiometric  fuel.  The  very  low  oxygen  potential  of  hypostoichiometric  fuel  does  not 
allow  complete  gasification  of  available  carbon.  So  C02  pressure  is  very  low.  Therefore 
the  CO  CX)2  mechanisms  would  be  extremely  slow  in  hypostoichiometric  fuels. 
However,  experiments  have  shown  a  rapid  oxygen  redistribution  in  hypostoichiometric 
fuels.  This  indicates  that  other  mechanisms  may  be  responsible.  Therefore,  a  model  of 
oxygen  transport  by  thermal  diffusion  of  oxygen  has  been  proposed.  In  this  model  the 
diffusing  species  are  considered  to  be  single  oxygen  vacancies.  The  heat  of  transport  for 
these  species  being  negative,  they  migrate  towards  high  temperature  region. 

4.3c  Fission  product  redistribution :  The  fission  process  in  nuclear  fuel  results  in 
production  of  atoms  of  more  than  30  elements  and  about  1 20  isotopes  of  these  elements 
have  appreciable  half  lives.  Due  to  steep  temperature  gradient  in  the  fuel  some  fission 
products  move  away  from  the  location  where  they  are  produced.  Loss  or  gain  of  fission 
products  from  the  region  where  they  were  created,  alters  many  important  physical  and 
chemical  properties  of  the  fuel  locally.  The  effect  of  fission  products  on  the  fuel 
behaviour  is  governed  by  their  physical  and  chemical  state.  One  of  the  important 
consequence  of  fission  product  redistribution  is  the  possibility  of  chemical  attack  by 
fuel  on  the  cladding  material. 

Studies  of  fission  product  distribution  in  irradiated  oxide  and  mixed  oxide  fuel  have 
provided  the  following  important  findings:  (i)  Fission  product  elements  Zr,  Nb,  Ce,  Pr, 
Ba  and  Sr  which  exists  as  non-volatile  oxide  in  solid  solution  with  the  fuel,  do  not 
redistribute,  (ii)  Noble  metals  Ru,  Rh,  Pd  and  a  part  of  Mo  exist  in  elemental  form  and 
migrate  significantly.  They  are  observed  as  white  metallic  inclusions  in  the  columnar 
grain  zone  or  as  alloy  in  the  central  void,  (iii)  Volatile  fission  products  Cs,  Rb,  I,  Te,  Sb 
and  Cd  tend  to  migrate  to  the  cooler  regions  in  the  fuel.  Gross  radial  migration  of 
cesium  occurs  from  high  temperature  central  regions  to  cooler  periphery  fuel  and  to  the 
pellet  interface  locations.  Axial  migration  of  cesium  to  the  plenum  is  also  observed.  This 
migration  behaviour  of  cesium  is  attributed  to  its  high  volatility,  (iv)  In  fuels  containing 
appreciable  amount  of  plutonium,  the  fission  yield  of  noble  metals  is  higher  and  the 
yield  of  oxide  forming  elements  is  lower  compared  to  their  yield  in  uranium  fuel.  Due  to 
this  effect,  a  part  of  oxygen  released  by  fission,  remains  in  uncombined  form  in  the  fuel 
and  increases  its  oxygen  potential. 

4.4  Effects  of  fission  gases 

A  substantial  fraction  (15%)  of  the  fission  products  in  nuclear  fuel  consists  of  rare 
gases,  xenon  and  krypton,  which  have  extremely  low  solubility  in  the  fuel  matrix.  If 
these  gases  are  released  from  the  fuel  body  they  increase  the  pressure  inside  the  fuel 
element  subjecting  the  cladding  tubes  to  stress.  On  the  other  hand,  if  they  are  retained  in 
the  fuel,  they  precipitate  as  gas  bubbles  and  cause  fuel  swelling.  The  dimensional 
changes  in  fuel  due  to  swelling  leads  to  mechanical  interaction  between  the  fuel  and 
cladding  which  often  leads  to  pellet  clad  interaction  failure  by  stress  corrosion  cracking 
(figure  15). 

4Aa  Fission  gas  swelling :  Swelling  of  the  fuel  is  attributed  to  the  volume  of  fuel 
occupied  by  the  gas  bubbles.  Bubble  formation  depends  on  the  mobility  of  gas  (either  as 
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Figure  15.  Cross-section  of  a  failed  U02  fuel  element  from  taps  which  was  examined  at 
barc.  The  rupture  of  the  cladding  was  caused  by  stress  corrosion  cracking  (Bahl  et  al  1979). 


single  atom  or  complexes),  the  minimum  number  of  gas  atoms  required  to  form  a  stable 
nucleus  and  the  rate  at  which  lattice  vacancies  can  be  supplied  to  enhance  the  stability  of 
a  nucleated  complex.  In  the  irradiation  environment  existing  in  the  fuel  the  nucleation 
and  growth  of  gas  bubbles  is  accompanied  by  the  irradiation  induced  re-solution  of 
bubbles  by  fission  fragments.  It  is  known  that  bubbles  which  are  stable  out-of-pile  are 
susceptible  to  re-solution  in  irradiation  environment  even  at  temperature  as  high  as 
1200°C. 

The  nucleation  of  bubbles  occurs  heterogeneously  on  the  fission  fragment  tracks 
(Turnbull  1971).  The  nucleation  sites  are  believed  to  be  vacancy-rich  segment  of  the 
fission  fragment  tracks.  Such  nucleation  is  independent  of  thermally  activated 
transport  properties  and  is  therefore  intrinsically  unaffected  by  temperature,  which  is  in 
agreement  with  the  observations  of  Cornell  (1971)  who  showed  that  density  of 
intragranular  bubbles  is  insensitive  to  temperature.  Since  two  xenon  atoms  form  a 
thermally  stable  nucleus,  homogeneous  nucleation  of  gas  bubbles  has  also  been 
considered  but  in  this  case  bubble  density  will  depend  on  temperature  which  is  opposed 
to  the  observations  of  Cornell. 

Bubbles  nucleated  under  irradiation  grow  by  capture  of  gas  atoms  and  vacancies  and 
then  get  dissolved  by  irradiation  induced  re-solution  (Turnbull  1971).  Concentration  of 
intragranular  bubbles  is  sustained  by  fresh  nucleation.  There  is  a  dynamic  equilibrium 
between  formation  of  gas  bubbles  by  nucleation  and  growth  on  one  hand  and 
resolution  of  the  bubbles  by  fission  fragments  on  the  other.  At  temperatures  above 
1 200°C  the  resolution  becomes  less  effective  because  of  the  increased  gas  atom  mobility. 
The  growth  of  bubbles  depends  on  the  temperature  (which  determines  the  gas  diffusion 
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coefficient)  and  burn-up  which  controls  the  dissolved  gas  atom  concentration.  Because 
of  the  re-solution  process  each  bubble  has  an  average  life  defined  by 

r-  [  \/2nr2  F  X\ 

where  r  is  bubble  radius,  F  is  fission  rate  and  X  is  range  of  fission  fragment.  This 
indicates  that  the  bubble  growth  is  restricted.  However,  the  observed  presence  of  large 
gas  bubbles  in  the  fuel  suggested  that  bubbles  larger  in  size  are  more  stable  because  the 
re-solution  is  not  so  effective  in  destroying  large  bubbles  as  it  is  with  small  bubbles. 
Turnbull  and  Cornell  (1970)  showed  that  bubbles  with  diameter  greater  than  24  A  have 
enhanced  stability  as  they  cannot  be  destroyed  in  a  single  event  of  bubble-fission 
fragment  interaction. 

The  equilibrium  number  of  bubble  per  unit  volume,  n  in  the  fuel  is  given  by 
n/T=2F<x, 

where  a  is  number  of  bubbles  nucleated  per  fission  fragment,  T  and  F  have  the  same 
meaning  as  in  earlier  equation. 

The  equilibrium  quantity  of  gas  m  in  intragranular  bubbles  per  unit  volume  of  fuel  is 
given  by  following  correlation  (Turnbull  1971): 

m  =  (4a/3A6')  (2b'CD/nFX)114, 

where  b'  is  Van  der  Waals  constant  for  fission  gas,  C  is  dissolved  gas  atom  concentration, 
D  is  gas  diffusion  coefficient,  F,  X  and  a  have  same  meaning  as  earlier.  This  indicates  that 
the  quantity  of  gas  in  intragranular  bubbles  increase  as  fourth  root  of  the  dissolved  gas 
concentration.  It  is  clear  that  the  fraction  of  gas  in  the  intragranular  bubble  is  invariably 
a  small  fraction  of  the  entire  gas  content.  The  importance  of  this  fact  is  that  the  gas  from 
the  matrix  migrate  to  grain  boundary  as  single  atom  and  forms  intergranular  bubbles. 
In  fact  a  major  part  of  the  swelling  observed  in  fuel  is  caused  by  formation  and  growth 
of  large  intergranular  bubbles  in  the  process  of  gas  release. 

Metallographic  observations  (Chubb  1972)  on  U02  irradiated  at  temperature 
below  1600°C  showed  an  equiaxed  grain  structure  with  extensive  porosity  along  the 
boundaries  where  fuel  temperatures  was  sufficiently  high.  This  porosity  accounts  for 
the  majority  of  the  fuel  swelling.  Detailed  investigations  of  intergranular  fractured 
surfaces  show  the  characteristics  of  bubbles  lying  on  grain  faces  and  those  situated  at 
grain  edges  and  corners.  The  latter  are  observed  to  be  extensively  developed  and 
generally  occupy  a  larger  volume  than  the  relatively  small  grain  face  bubbles. 

4.46  Fission  gas  release :  Release  of  fission  gases  from  the  fuel  to  the  free  volume  of  the 
fuel  elements  is  governed  by  the  mobility  of  the  gas  as  individual  atoms  or  as  bubbles  in 
the  existing  condition  during  irradiation.  The  release  of  fission  gases  is  observed  to  be 
strongly  dependent  on  temperature.  Other  factors  which  influence  the  release  are  grain 
size,  burn-up,  fission  rate,  fraction  of  open  porosity,  fuel  density  and  stoichiometry  of 
fuel  in  U02. 

Xenon  is  found  to  diffuse  in  U02  as  a  complex  of  constant  size.  This  complex  consists 
of  a  xenon  atom  bound  with  one  uranium  vacancy*  and  two  oxygen  vacancies.  The 
diffusion  of  xenon  has  been  studied  by  Cornell  (1969).  It  is  found  that  xenon  has  no 
significant  mobility  in  U02  at  temperatures  below  1000°C.  Migration  of  bubbles  may 
occur  by  several  processes.  The  effective  diffusion  coefficient  for  migration  of  gas 
bubble  at  a  constant  temperature  is  dependent  on  its  radius  (r)  and  is  proportional  to 
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r~n  (Gittus  1978)  where  the  exponent  is  dependent  on  the  physical  process  responsible 
for  bubble  movement.  The  value  of  n  is  4  for  surface  diffusion,  3  for  volume  diffusion 
and  2  when  bubble  movement  occurs  by  diffusion  of  atoms  in  a  vapour  phase  across 
bubble  filled  with  equilibrium  pressure  of  insoluble  gas.  It  is  found  that  under  the  effect 
of  temperature  gradient  surface  diffusion  is  the  dominant  mechanism  of  migration  for 
bubbles  with  radii  less  than  16  microns.  Bubbles  of  larger  (>  16  micron)  size  move  by 
vapour  transport  mechanism.  However,  no  bubble  migration  is  observed  at  tempera¬ 
tures  below  1 500°C  and  the  main  mode  of  gas  mobility  upto  1 500°C  remains  atomic 
diffusion.  The  method  of  predicting  gas  release  in  terms  of  the  diffusion  coefficient 
alone  is,  however,  not  reliable  because  of  the  existence  of  different  types  of  traps  like 
dislocations,  grain  boundaries,  bubbles,  lattice  defects,  defects  clusters,  precipitates  of 
fission  products.  Moreover,  sudden  changes  in  gas  release  with  changes  in  reactor 
power  have  been  observed  which  could  not  be  accounted  for  by  the  diffusion  processes. 
‘Gas  burst’  is  observed  to  occur  at  start-up  and  shutdown  and  during  power  transients. 

Two  basic  mechanisms  of  gas  release  operate  below  1000°C.  They  are  based  on  two 
collision  processes,  (a)  ‘recoil’  in  which  a  fission  event  near  the  surface  produces  a  gas 
atom  which  has  sufficient  energy  to  escape  from  the  solid,  (b)  ‘knock-out’  where  a 
fission  fragment  collides  with  a  gas  atom  with  sufficient  energy  to  knock  it  out  of  the 
solid. 

In  the  temperature  range  1 000-1 600°C  the  release  occurs  by  atomic  diffusion  of  gas 
atoms  from  grain  interior  to  the  grain  boundaries  (figure  16)  where  the  ready  supply  of 
vacancies  permits  the  gas  atoms  to  form  gas  bubbles  large  enough  to  survive  the 
re-solution  effect.  Bubbles  on  the  grain  boundary  grow  by  collection  of  atoms  from 
grain  interior.  If  the  temperature  is  sufficiently  high  to  cause  grain  growth,  the  grain 
boundary  sweeping  of  the  fission  gas  atoms  from  the  fuel  provides  additional  gas  at  the 
boundary.  The  grain  boundary  bubbles  grow  at  a  rate  determined  by  the  supply  of  gas 
atoms  at  the  boundary.  A  point  is  reached  when  the  bubbles  are  large  enough  to 
interlink  with  each  other  representing  grain  boundary  saturation  with  fission  gases 
forming  a  continuous  channel  for  release  of  the  gas  to  open  surfaces.  Since  the  grain 
boundary  saturation  is  a  prerequisite  for  release  of  gases  in  this  mechanism,  there  has  to 
be  an  incubation  period  (threshold  burn-up)  before  the  gas  release  occurs.  Threshold 
burn-up  for  gas  release  is  a  function  of  temperature;  higher  the  temperature 
lower  the  burn-up  when  release  starts.  Several  models  of  fission  gas  release 
(Collins  et  al  1973;  Notley  and  Hastings  1980)  have  been  developed  using  the  above 
mechanism. 

At  temperature  above  1600°C  almost  all  the  fission  gas  generated  in  the  fuel  is 
released.  The  release  occurs  by  movement  of  gas  bubbles  which  are  sufficiently  mobile 
at  such  high  temperatures.  The  closed  pores  in  the  fuel  also  move  up  the  gradient  and 
sweep  the  fission  gases  on  their  way  to  the  fuel  centre  where  they  are  released  to  the 
central  void.  The  above  mechanism  of  pore  and  bubble  migration  for  gas  release  is  also 
responsible  for  formation  of  radially  oriented  columnar  grains  in  the  fuel. 

In  spite  of  large  amount  of  theoretical  and  experimental  work  on  fission  gas  release 
behaviour  in  ceramic  fuels,  the  understanding  of  the  mechanism  is  not  complete.  There 
is  often  a  wide  gap  between  the  predictions  and  experimental  observations  of  fission  gas 
release  especially  in  transient  tested  fuel  elements  (Pickman  1984).  Intensive  research  in 
this  field,  however,  unveiled  some  interesting  phenomena  in  mid-sixties.  A  scientifically 
important  discovery  was  that  fission  events  could  cause  fission  gas  bubbles  to  redissolve 
into  the  lattice. 
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Figure  16.  The  microstructure  of  an  irradiated  U02  fuel  from  taps  showing  presence  of 
fission  gas  bubbles  on  the  grain  boundary. 


4.5  Irradiation  creep 

Irradiation  creep  plays  a  very  important  role  in  controlling  the  strain  rate  of  cladding 
when  fuel-cladding  mechanical  interaction  starts  in  fuel  elements  at  high  burn-up.  The 
effect  of  irradiation  on  creep  of  ceramic  fuels  is  two-foid  (a)  to  enhance  the  thermal 
creep,  and  (b)  to  develop  creep  under  condition  in  which  thermal  creep  is  absent.  U02 
exhibits  creep  even  at  temperatures  below  1100  C  during  irradiation  (figure  17).  The 
irradiation  induced  creep  rate  is  observed  to  be  a  linear  function  of  stress  and  fission 
rate.  Several  mechanisms  have  been  proposed  for  irradiation  creep  in  ceramic  fuels 
(Dienst  1977;  Gilbert  1971).  The  mechanisms  proposed  for  irradiation  creep  fall  in 
three  categories  (i)  mechanisms  based  on  diffusional  creep  (ii)  mechanisms  based  on 
fission  spike  effect  in  fuel  (iii)  mechanisms  based  on  growth  of  dislocation  loops. 

It  was  earlier  thought  that  increase  in  vacancy  concentration  resulting  from 
irradiation  may  enhance  the  volume  self-diffusion  term  in  Nabarro-Herrmg  creep 
model  and  this  was  responsible  for  the  enhancement  of  creep  rate  in  the  fuel.  However  it 
was  shown  (Olander  1 976)  that  creep  rate  in  Nabarro-Herring  creep  model  does  not  get 
enhanced  because  the  defects  produced  by  radiation  in  the  grain  are  either  annihilated 
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T,  100  °C 


Figure  17.  Irradiation  creep  rate  of  U02  as  a  function  of  temperature  (Solomon  et  al  1971). 


or  flow  equally  to  the  grain  boundary.  Brucklacher  considered  that  the  point  defects 
formed  during  irradiation  anneal  at  the  defects  clusters  present  inside  the  grains.  He 
replaced  the  grain  of  Nabarro-Herring  model  by  defect  clusters  of  1000  A  size  and  was 
able  to  predict  irradiation  creep  which  was  in  agreement  with  the  experiment.  Another 
modified  diffusional  model  presented  by  Brucklacher  et  al  (1970)  for  irradiation  creep 
considered  diffusion  of  vacancies  between  dislocation  loops  formed  during  irradiation 
having  different  orientations  relative  to  the  stress  direction.  Tensile  stress  causes 
growth  of  loops  which  are  normal  to  the  stress,  at  the  expense  of  loops  parallel  to  the 
stress  direction. 

One  of  the  spike-based  mechanisms  proposed  that  creep  is  accelerated  by  transient 
pulse  heating  of  fuel  by  fission  spikes.  As  the  material  in  cylindrical  region  around  the 
axis  of  the  fission  fragment  track  is  momentarily  brought  to  liquid  state  there  is  a  very 
large  increase  in  the  vacancy  diffusion  coefficient  which  gives  acceleration  of  creep. 
Perrin  has  proposed  another  mechanism  based  on  spike  effect  which  is  applicable  to 
irradiation  enhanced  creep  in  fuel.  In  this  mechanism  the  fuel  undergoes  continuous 
annealing  due  to  fission  spikes;  strain  hardening  is  prevented  and  fuel  creeps 
continuously  in  primary  stage  (figure  1 8).  The  creep  rate  is  given  by  the  primary  thermal 
creep  rate  and  is  governed  by  temperature  and  fission  rate.  Creep  is  considered 
enhanced  with  respect  to  the  secondary  thermal  creep  rate  which  is  normally  lower  than 
primary  creep  rate  due  to  strain  hardening.  Besides  the  above  mechanism  the  elastic 
stress  relaxation  mechanism  described  in  §3.2  has  also  been  proposed  for  ceramic  fuels. 

Dislocation-based  creep  mechanisms  consider  the  formation  and  growth  of 
dislocation  loops  under  irradiation.  It  is  considered  that  under  the  influence  of  stress 
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Figure  18.  Maintenance  of  primary  creep  mechanism  by  fission  spike  annealing  (Dienst 
1977). 

the  irradiation-produced  defect  clusters  collapse  into  dislocation  loops  preferentially 
on  certain  planes.  Appropriately  oriented  loops  grow  by  capture  of  interstitials.  The 
creep  rate  in  this  mechanism  is  given  as  a  function  of  stress  and  fission  rate. 


5.  Prediction  of  fuel  element  behaviour 


The  behaviour  of  nuclear  fuel  elements  during  irradiation  is  a  function  of  large  number 
of  processes  which  take  place  simultaneously  and  interact  with  each  other  in  a  complex 
manner.  Figure  19  shows  the  complicated  interrelation  between  various  physical, 
chemical,  mechanical  and  metallurgical  phenomena  which  occur  inside  a  water  reactor 
fuel  element.  To  predict  the  irradiation  behaviour  of  the  fuel  element  a  computer  model 
is  needed  which  represents  the  interactive  processes  shown  in  this  figure  and  allows  a 
quantitative  estimation  of  the  evolution  of  changes  in  the  fuel  element.  The  computer 
model  incorporates  all  the  contemporary  data  of  the  material  properties  and  the  latest 
understanding  of  the  phenomena  and  their  mutual  interactions.  It  is  a  very  useful  tool 
for  design  of  fuel  elements,  safety  analysis  and  for  the  interpretation  of  the  results  of 
irradiation  experiments. 

The  analysis  performed  by  fuel  element  computer  models  can  be  grouped  in  two 
categories:  thermal  analysis  and  mechanical  analysis.  The  thermal  analysis  concerns  the 
calculations  of  the  fuel  temperature  distribution  in  the  fuel  set  up  by  the  volumetric  heat 
generation.  In  a  cylindrical  fuel  element,  the  steady  state  radial  temperature  distri¬ 
bution  is  obtained  by  solution  of  following  equation  with  appropriate  boundary 
conditions. 


£(■*£) 


+  H  =  0, 


where  k  is  the  thermal  conductivity  of  fuel  which  is  a  function  of  temperature  and 
depends  on  fuel  density.  H  is  volumetric  heat  generation  rate.  The  heat  generation  rate 
in  thermal  reactor  fuel  elements  is  not  uniform  along  the  fuel  radius  owing  to  thermal 
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Figure  19.  Interrelation  of  physical,  chemical  and  metallurgical  processes  operating  in  a 
ceramic  fuel  element  during  irradiation  (Sah  et  al  1983). 


neutron  flux  depression  towards  the  fuel  centre.  With  the  progress  of  irradiation,  the 
magnitude  and  radial  variation  of  fuel  thermal  conductivity  (k)  and  heat  generation  rate 
(H)  changes  with  time  due  to  several  irradiation  effects  e.g-  porosity  redistribution, 
fissile  material  segregation,  restructuring  etc.  The  temperature  distribution  is  evaluated 
at  small  time  intervals  accounting  for  the  irradiation  effects  in  the  fuel  element. 

One  important  aspect  of  calculation  Of  temperature  distribution  in  fuel  is  the  heat 
transfer  across  the  fuel  cladding  gap  or  interface.  The  fuel  centre  temperature  is  very 
sensitive  to  the  gap  conductance  which  governs  the  temperature  drop  in  the  fuel-clad 
gap.  The  heat  transfer  coefficient  ( h )  for  conduction  of  heat  through  the  gas  filled  gap  is 
given  by 

is 

h  =  - gas - 

^gap  +  0i+0/  +  <5  ’ 

where  K  is  the  thermal  conductivity  of  the  gas,  X  is  the  fuel  clad  gap  in  operating 

o  o  i 

conditions,  S  is  the  sum  of  surface  roughness  of  the  fuel  and  the  cladding,  gf  and  gt  are 
jump  distance  of  the  gas  in  fuel  and  cladding.  The  value  of  K  decreases  considerably 
by  the  release  of  fission  gas  to  the  fuel  clad  g^p  since  xenon  has  a  very  low  thermal 
conductivity  compared  to  helium  which  is  used  as  a  filler  gas  in  the  fuel  element.  The 
value  of  Xgap  during  irradiation  is  controlled  by  the  processes  responsible  for  gap 
closure  e.g.  densification,  swelling  and  creep.  Uncertainty  in  the  prediction  of  gas 
release  and  fuel  clad  gap  leads  to  uncertainty  in  the  fuel  temperature. 
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The  mechanical  analysis  is  carried  out  assuming  axisymmetry  and  plane  strain 
conditions.  The  total  strain  in  the  fuel  and  the  cladding  is  considered  to  be  composed  of 
elastic,  thermal,  swelling,  creep  and  plastic  strains.  The  constitutive  equations  used  in 
the  modelling  are 


£r  =  —  [pr  -  v(<T0  +  (Tz)]  +  g{h  4-  eSr  4-  £Cr  +  g?1, 

E 

S«  =  ]7  K  -  V(<7r  +  <0]  +  4h  +  4  +  4  +  «J‘, 

^  [_Oz  ~  v(<Tr  +  ff,)]  +  «ih  +  Ss2  +  +  £?', 

where  v  is  the  Poisson’s  ratio,  er,  ee  and  ez  are  strains  in  principal  directions  or,  oe,  oz  are 
stresses  in  principal  directions,  E  is  elastic  modulus,  gth  is  thermal  strain,  8s  is  strain  due 
to  swelling,  ec  is  creep  strain  and  gP1  is  plastic  strain.  Swelling  and  thermal  strains  are 
isotropic  and  hence  ejh  =  4h  =  g *h  =  ctT  where  a  is  linear  thermal  expansions  coefficient 
and 

g*  =  es0  =  esz  =  i  (volumetric  swelling). 


Creep  and  plastic  strains  are  calculated  using  equivalent  stress  o*  which  is  defined  as 


<7*  = 


1 


[(<7r  -  a 0)2  4-  (c Te  -  oz)2  -1-  {or  -  az)2] 


211/2 


Though  the  constitutive  equations  remain  the  same,  codes  developed  may  differ  in 
degree  of  details,  numerical  methods  used  and  processes  modelled. 

The  fuel  modelling  codes  developed  to  describe  the  behaviour  of  nuclear  fuel 
elements  fall  in  two  general  groups.  In  the  first  category,  the  models  are  more 
phenomenological  and  consist  of  a  fast-running,  empirical  models  of  processes  to 
describe  the  fuel  behaviour.  In  the  second  category  of  codes,  the  models  are  treated  in 
more  detail  and  include  the  various  processes  which  are  considered  to  control  the 
behaviour  of  the  fuel  elements.  The  main  objective  of  such  codes  is  to  relate  the  material 
properties  and  physical  mechanism  to  the  widest  possible  range  of  fuel  element 
geometries,  material  parameters  and  operating  conditions.  Because  of  the  firm  physical 
basis  for  the  models,  these  codes  allow  extrapolation  over  a  wide  range  of  conditions. 

A  number  of  codes  have  been  developed  for  nuclear  fuel  elements  all  over  the  world. 
Some  of  the  well  known  fuel  modelling  codes  are  comethe  (Belgium),  elesim  (Canada), 
minipat  and  sleuth  seer  (uk),  uranus  (frg),  life  (usa).  In  India  two  computer  codes 
have  been  developed  for  analysis  of  water  reactor  fuel  element  performance,  profess 
(Sah  and  Venkatesh  1983,  1984)  and  comta  (Anand  et  al  1980).  profess  is  used  for 
analysis  and  interpretation  of  post-irradiation  examination  results  obtained  on  fuel 
elements  (Roy  et  al  1984). 


6.  Summary 

A  review  of  the  important  effects  observed  in  metallic  uranium  and  ceramic  nuclear 
fuels  during  irradiation  has  been  made.  The  mechanism  of  the  processes  occurring  in 
the  fuel  during  irradiation  has  been  discussed.  The  basic  cause  of  several  effects  in  fuel 
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arises  from  the  effects  of  highly  ionised  fission  fragments  and  high  energy  neutrons.  The 
fission  fragments  give  rise  to  fission  spike,  lattice  defects  and  impurity  atoms  in  the  fuel. 

Alpha  uranium  exhibits  dimensional  instability  during  irradiation.  This  is  caused  by 
three  irradiation-induced  phenomena  namely  irradiation  growth  (shape  change  at 
constant  volume  without  any  external  stress  application),  irradiation  creep  (deforma¬ 
tion  under  small  external  stress)  and  swelling  (due  to  solid  and  gaseous  fission 
products).  Irradiation  growth  of  uranium  is  related  to  the  anisotropy  of  alpha  uranium 
crystal  and  the  mechanism  of  growth  involves  formation  of  extra  layers  of  interstitial 
atoms  on  certain  crystallographic  planes. 

A  very  severe  thermal  and  radiation  environment  (fuel  temperatures  approaching 
melting  point  and  thermal  gradients  in  the  range  3000-5000°C/cm)  initiates  a  number 
of  interactive  processes  in  the  ceramic  fuel  during  irradiation.  Several  spectacular 
changes  in  geometry,  dimension,  chemistry  and  microstructure  of  the  fuel  are  observed. 
Shrinkage  and  dissolution  of  as-fabricated  pores  by  fission  fragments  leads  to  fuel 
densification  in  the  early  stages  of  fuel  irradiation.  The  initial  microstructure  of  the  fuel 
gets  altered  and  several  restructured  radial  zones  are  observed  in  fuel  cross-sections. 
One  of  the  important  microstructural  changes  is  the  formation  of  high  density 
columnar  grain  by  movement  of  porosity  towards  the  centre.  Plutonium  which  is 
initially  distributed  uniformly  in  the  fuel  is  observed  to  segregate  to  preferred  locations 
(towards  fuel  centre  or  away  from  fuel  centre)  depending  upon  the  initial  stoichiometry 
of  the  fuel.  Oxygen  in  the  fuel  gets  redistributed  to  maintain  the  thermodynamic 
equilibrium  between  C02,  CO  and  oxygen  in  the  existing  radial  temperature  gradient. 
Many  fission  products  especially  the  noble  metals  and  volatile  elements  produced  in  the 
fuel  are  also  observed  to  redistribute  themselves.  A  large  fraction  of  fission  products  in 
fuel  is  the  fission  gases  xenon  and  krypton  which  are  insoluble  in  the  fuel  matrix.  The 
fission  gases  precipitate  as  gas  bubbles  in  the  fuel  and  cause  fuel  swelling.  A  part  of  the 
fission  gas  is  released  from  the  fuel.  The  gas  release  occurs  by  three  mechanisms:  recoil 
and  knock  out  (below  1000°C);  atomic  diffusion  to  the  grain  boundary  (1 000-1 600°C) 
and  bubble  migration  to  the  fuel  centre  (above  1600°C).  Another  important  pheno¬ 
mena  observed  in  ceramic  fuel  is  irradiation  creep.  Irradiation-induced  creep  is  found 
to  take  place  in  U02  at  a  low  temperature  ( <  1 100°C)  where  thermal  creep  is  absent. 
Three  types  of  mechanisms  have  been  proposed:  diffusion  based,  spike  based  and 
dislocation  based.  A  satisfactory  model  of  irradiation  creep  in  ceramic  fuel  is  yet  to  be 
developed. 
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